Engineered immunoglobulin heavy chain-light chain pairs and uses thereof

ABSTRACT

The present invention provides heterodimer pairs that can comprise a first heterodimer and a second heterodimer wherein each heterodimer comprises an immunoglobulin heavy chain or fragment thereof and an immunoglobulin light chain or fragment thereof. At least one of the heterodimers can comprise one or more amino acid modifications in the CH1 and/or CL domains, one or more amino acid modifications in the VH and/or VL domains, or a combination thereof. The modified amino acid(s) can be part of the interface between the light chain and heavy chain and are typically modified to create preferential pairing between each heavy chain and a desired light chain such that when the two heavy chains and two light chains of the heterodimer pair are co-expressed in a cell, the heavy chain of the first heterodimer preferentially pairs with one of the light chains rather than the other. Likewise, the heavy chain of the second heterodimer typically preferentially pairs with the second light chain rather than first.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is the divisional of U.S. patent application Ser. No. 14/648,222, filed May 28, 2015, which is the U.S. National Entry of PCT/CA2013/050914, filed Nov. 28, 2018, which claims the benefit of U.S. Provisional Application No. 61/730,906, filed Nov. 28, 2012, U.S. Provisional Application No. 61/761,641, filed Feb. 6, 2013, U.S. Provisional Application No. 61/818,874, filed May 2, 2013, and U.S. Provisional Application No. 61/869,200, filed Aug. 23, 2013, the entire disclosure of each of which is hereby incorporated by reference in its entirety for all purposes.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jan. 30, 2014, is named 24897PCT_CRF_sequencelisting.txt, and is 91,025 bytes in size.

BACKGROUND

Bi-specific antibodies are capable of binding to two different epitopes. The epitopes can be on the same antigen, or each epitope can be on a different antigen. This feature of bi-specific antibodies makes them an attractive tool for various therapeutic applications where there is a therapeutic benefit to targeting or recruiting more than one molecule in the treatment of disease. One of the approaches to form bi-specific antibody would involve concomitant expression of two unique antibody heavy chains and two unique antibody light chains. Correctly forming bi-specific antibodies in a format that is similar to wild-type remains a challenge, since antibody heavy chains have evolved to bind antibody light chains in a relatively promiscuous manner. As a result of this promiscuous pairing, concomitant expression of two antibody heavy chains and two antibody light chains naturally leads to a scrambling of heavy chain-light chain pairings. This mispairing remains a major challenge for the generation of bi-specific therapeutics, where homogeneous pairing is an essential requirement for good manufacturability and biological efficacy.

Several approaches have been described to prepare bi-specific antibodies in which specific antibody light chains or fragment pair with specific antibody heavy chains or fragments. A review of various approaches to address this problem can be found in Klein et al., (2012) mAbs 4:6, 1-11. International Patent Application No. PCT/EP2011/056388 (WO 2011/131746) describes an in vitro method for generating a heterodimeric protein in which asymmetrical mutations are introduced into the CH3 regions of two monospecific starting proteins in order to drive directional “Fab-arm” or “half-molecule” exchange between two monospecific IgG4- or IgG4-like antibodies upon incubation under reducing conditions.

Schaefer et al. (Roche Diagnostics GmbH), describe a method to assemble two heavy and two light chains, derived from two existing antibodies, into human bivalent bi-specific IgG antibodies without use of artificial linkers (PNAS (2011) 108(27): 11187-11192). The method involves exchanging heavy chain and light chain domains within the antigen-binding fragment (Fab) of one half of the bi-specific antibody.

Strop et al. (Rinat-Pfizer Inc.), describe a method of producing stable bi-specific antibodies by expressing and purifying two antibodies of interest separately, and then mixing them together under specified redox conditions (J. Mol. Biol. (2012) 420:204-19).

Zhu et al. (Genentech) have engineered mutations in the VL/VH interface of a diabody construct consisting of variant domain antibody fragments completely devoid of constant domains, and generated a heterodimeric diabody (Protein Science (1997) 6:781-788). Similarly, Igawa et al. (Chugai) have also engineered mutations in the VL/VH interface of a single-chain diabody to promote selective expression and inhibit conformational isomerization of the diabody (Protein Engineering, Design & Selection (2010) 23:667-677).

US Patent Publication No. 2009/0182127 (Novo Nordisk, Inc.) describes the generation of bi-specific antibodies by modifying amino acid residues at the Fc interface and at the CH1:CL interface of light-heavy chain pairs that reduce the ability of the light chain of one pair to interact with the heavy chain of the other pair.

SUMMARY

Described herein is an isolated antigen binding polypeptide construct comprising at least a first heterodimer and a second heterodimer, the first heterodimer comprising a first immunoglobulin heavy chain polypeptide sequence (H1), and a first immunoglobulin light chain polypeptide sequence (L1); and the second heterodimer comprising a second immunoglobulin heavy chain polypeptide sequence (H2), and a second immunoglobulin light chain polypeptide sequence (L2), wherein at least one of the H1 or L1 sequences of the first heterodimer is distinct from the corresponding H2 or L2 sequence of the second heterodimer, and wherein H1 and H2 each comprise at least a heavy chain variable domain (V_(H) domain) and a heavy chain constant domain (C_(H1) domain); L1 and L2 each comprise at least a light chain variable domain (V_(L) domain) and a light chain constant domain (C_(L) domain); and at least one of H1, H2, L1 and L2 comprises at least one amino acid modification of at least one constant domain and/or at least one variable domain, wherein H1 preferentially pairs with L1 as compared to L2 and H2 preferentially pairs with L2 as compared to L1.

In some aspects, the construct further comprises a heterodimeric Fc, the Fc comprising at least two CH3 sequences, wherein the Fc is coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer, wherein the dimerized CH3 sequences have a melting temperature (Tm) of about 68° C. or higher as measured by differential scanning calorimetry (DSC), and wherein the construct is bispecific.

In some aspects, the at least one amino acid modification of is selected from at least one amino acid modification shown in the Tables or Examples.

In some aspects, H1 pairs preferentially with L1 as compared to L2, and H2 pairs preferentially with L2 as compared to L1, when H1, H2, L1 and L2 are co-expressed in a cell or a mammalian cell, or when H1, H2, L1 and L2 are co-expressed in a cell-free expression system, or when H1, H2, L1 and L2 are co-produced, or when H1, H2, L1 and L2 are co-produced via a redox production system.

In some aspects, at least one of H1, H2, L1 and L2 comprises at least one amino acid modification of a V_(H) and/or V_(L) domain and at least one amino acid modification of a C_(H1) and/or C_(L) domain such that H1 pairs preferentially with L1 as compared to L2, and/or H2 pairs preferentially with L2 as compared to L1.

In some aspects, if H1 comprises at least one amino acid modification in the C_(H1) domain, then at least one of L1 and L2 comprise at least one amino acid modification in the C_(L) domain; and/or if H1 comprises at least one amino acid modification in the V_(H) domain, then at least one of L1 and L2 comprise at least one amino acid modification in the V_(L) domain.

In some aspects, H1, L1, H2, and/or L2 comprises at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid mutations. In some aspects, at least one of H1, H2, L1 and L2 comprises at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid modifications of at least one constant domain and/or at least one variable domain.

In some aspects, when both L1 and L2 are co-expressed with at least one of H1 and H2, the relative pairing of the at least one of H1-L1 and H2-L2 heterodimer pair to that of the respective corresponding H1-L2 or H2-L1 heterodimer pair is greater than 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99%, and wherein the relative pairing of the modified H1-L1 or H2-L2 heterodimer pair is greater than the respective relative pairing observed in the corresponding H1-L1 or H2-L2 heterodimer pair without the at least one amino acid modification.

In some aspects, the thermal stability as measured by the melting temperature (Tm) as measured by DSC of at least one of the first and second heterodimers is within about 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10° C. of the Tm of the corresponding heterodimer without the at least one amino acid modification. In some aspects, the thermal stability as measured by the melting temperature (Tm) as measured by DSC of each heterodimer comprising at least one amino acid modification is within about 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10° C. of the Tm of the corresponding heterodimer without the at least one amino acid modification.

In some aspects, the affinity of each heterodimer for the antigen to which it binds is within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 25, 30, 35, 40, 45, or 50-fold of the affinity of the respective unmodified heterodimer for the same antigen as measured by surface plasmon resonance (SPR) or FACS.

In some aspects, at least one of H1 and L1 comprises at least one domain comprising at least one amino acid modification resulting in greater steric complementarity of amino acids when H1 pairs with L1 as compared to L2. In some aspects, at least one of H2 and L2 comprises at least one domain comprising at least one amino acid modification resulting in greater steric complementarity of amino acids when H2 pairs with L2 as compared to L1. In some aspects, at least one of H1 and L1 comprises at least one domain comprising at least one amino acid modification resulting in greater electrostatic complementarity between charged amino acids when H1 pairs with L1 as compared to L2. In some aspects, at least one of H2 and L2 comprises at least one domain comprising at least one amino acid modification resulting in greater electrostatic complementarity between charged amino acids when H2 pairs with L2 as compared to L1.

In some aspects, the at least one amino acid modification of is a set of mutations shown in at least one of the Tables or Examples. In some aspects, the at least one modification is not H1-Q39E, L1-Q38K, H2-Q39K, and L2-Q38E. In some aspects, the at least one modification is not H1-Q39E, L1-Q38E, H2-Q39K, and L2-Q38K.

In some aspects, the construct further comprises an Fc comprising at least two CH3 sequences, wherein the Fc is coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer.

In some aspects, the Fc is a human Fc, a human IgG1 Fc, a human IgA Fc, a human IgG Fc, a human IgD Fc, a human IgE Fc, a human IgM Fc, a human IgG2 Fc, a human IgG3 Fc, or a human IgG4 Fc. In some aspects, the Fc is a heterodimeric Fc. In some aspects, the Fc comprises one or more modifications in at least one of the CH3 sequences. In some aspects, the dimerized CH3 sequences have a melting temperature (Tm) as measured by DSC of about 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 77.5, 78, 79, 80, 81, 82, 83, 84, or 85° C. or higher. In some aspects, the Fc is a heterodimer formed with a purity greater than about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% when produced; or wherein the Fc is a heterodimer formed with a purity greater than about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% when expressed or when expressed via a single cell. In some aspects, the Fc comprises one or more modifications in at least one of the CH3 sequences that promote the formation of a heterodimeric Fc with stability comparable to a wild-type homodimeric Fc. In some aspects, the Fc further comprises at least one CH2 sequence. In some aspects, the CH2 sequence(s) of the Fc comprises one or more modifications. In some aspects, the Fc comprises one or more modifications to promote selective binding of Fc-gamma receptors.

In some aspects, the Fc is coupled to the heterodimers by one or more linkers, or wherein the Fc is coupled to H1 and H2 by one or more linkers. In some aspects, the one or more linkers are one or more polypeptide linkers. In some aspects, the one or more linkers comprises one or more antibody hinge regions. In some aspects, the one or more linkers comprises one or more IgG1 hinge regions. In some aspects, the one or more linkers comprises one or more modifications. In some aspects, the one or more modifications to the one or more linkers promote selective binding of Fc-gamma receptors.

In some aspects, the at least one amino acid modification is at least one amino acid mutation or wherein the at least one amino acid modification is at least one amino acid substitution.

In some aspects, the sequences of each of H1, H2, L1, and L2 are derived from human sequences.

In some aspects, the construct is multispecific or bispecific. In some aspects, the construct is multivalent or bivalent.

Also described herein is an isolated polynucleotide or set of isolated polynucleotides comprising at least one sequence that encodes a construct described herein. In some aspects, the polynucleotide or set of polynucleotides is cDNA.

Also described herein is a vector or set of vectors comprising one or more of the polynucleotides or sets of polynucleotides described herein. In some aspects, the vector or set of vectors is selected from the group consisting of a plasmid, a multi-cistronic vector, a viral vector, a non-episomal mammalian vector, an expression vector, and a recombinant expression vector.

Also described herein is an isolated cell comprising a polynucleotide or set of polynucleotides described herein or a vector or set of vectors described herein. In some aspects, the cell is a hybridoma, a Chinese Hamster Ovary (CHO) cell, or a HEK293 cell.

Also described herein is a pharmaceutical composition comprising a construct described herein and a pharmaceutically acceptable carrier. In some aspects, the composition further comprises one or more substances selected from the group consisting of a buffer, an antioxidant, a low molecular weight molecule, a drug, a protein, an amino acid, a carbohydrate, a lipid, a chelating agent, a stabilizer, and an excipient.

Also described herein is a use of a construct described herein or a pharmaceutical composition described herein for the treatment of a disease or disorder or cancer or vascular disease in a subject or in the manufacture of a medicine.

Also described herein is a method of treatment of a subject having a disease or disorder or cancer or vascular disease comprising administering to the subject a construct described herein or a composition described herein.

Also described herein is a method of inhibiting, reducing or blocking a signal within or to a cell, comprising contacting the cell with a construct described herein or a composition described herein.

Also described herein is a method of obtaining a construct described herein from a host cell culture, the method comprising the steps of: (a) obtaining a host cell culture comprising at least one host cell comprising one or more nucleic acid sequences encoding the construct; and (b) recovering the construct from the host cell culture.

Also described herein is a method of obtaining a construct described herein comprising the steps of: (a) obtaining H1, L1, H2, and L2; (b) allowing H1 to pair preferentially with L1 as compared to L2 and H2 to pair preferentially with L2 as compared to L1; and (c) obtaining the construct.

Also described herein is a method of preparing a construct described herein comprising: obtaining a polynucleotide or set of polynucleotides encoding at least one construct; determining the optimal ratios of each of the polynucleotide or set of polynucleotides for introduction into at least one host cell, wherein the optimal ratios are determined by assessing the amount of H1-L1 and H2-L2 heterodimer pairs formed upon expression of H1, L1, H2, and L2 as compared to mispaired H1-L2 and H2-L1 heterodimer pairs formed upon expression of H1, L1, H2, and L2; selecting a preferred optimal ratio, wherein transfection of at least one host cell with the preferred optimal ratio of the polynucleotide or set of polynucleotides results in expression of the construct; transfecting the at least one host cell with the optimal ratio of the polynucleotide or set of polynucleotides; and culturing the at least one host cell to express the construct.

In some aspects, selecting the optimal ratio is assessed by transfection in a transient transfection system. In some aspects, transfection of the at least one host cell with the preferred optimal ratio of the polynucleotide or set of polynucleotides results in optimal expression of the construct. In some aspects, the construct comprises an Fc comprising at least two CH3 sequences, wherein the Fc is coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer. In some aspects, the Fc is a heterodimer, optionally comprising one or more amino acid modifications.

Also described herein is a computer-readable storage medium storing a dataset comprising data representing complementary mutations in a first heterodimer comprising a first immunoglobulin heavy chain polypeptide sequence (H1) and a first immunoglobulin light chain polypeptide sequence (L1); and a second heterodimer comprising a second immunoglobulin heavy chain polypeptide sequence (H2) and a second immunoglobulin light chain polypeptide sequence (L2), wherein H1 and H2 each comprise at least a heavy chain variable domain (V_(H) domain) and a heavy chain constant domain (C_(H1) domain); wherein L1 and L2 each comprise at least a light chain variable domain (V_(L) domain) and a light chain constant domain (C_(L) domain), and wherein the dataset of complementary mutations comprises data representing those mutations listed in the Tables or Examples or a subset of those mutations; and computer executable code for determining the likelihood that H1 will pair preferentially with L1 as compared to L2 and/or H2 will pair preferentially with L2 as compared to L1.

Also described herein is a computer implemented method for determining preferential pairing, comprising: obtaining a dataset comprising data representing complementary mutations in a first heterodimer comprising a first immunoglobulin heavy chain polypeptide sequence (H1) and a first immunoglobulin light chain polypeptide sequence (L1); and a second heterodimer comprising a second immunoglobulin heavy chain polypeptide sequence (H2) and a second immunoglobulin light chain polypeptide sequence (L2), wherein H1 and H2 each comprise at least a heavy chain variable domain (V_(H) domain) and a heavy chain constant domain (C_(H1) domain); wherein L1 and L2 each comprise at least a light chain variable domain (V_(L) domain) and a light chain constant domain (C_(L) domain), and wherein the dataset of complementary mutations comprises data representing those mutations listed in the Tables or Examples or a subset of those mutations; and determining, by a computer processor, the likelihood that H1 will pair preferentially with L1 as compared to L2 and/or H2 will pair preferentially with L2 as compared to L1. In some aspects, the method further comprises producing a construct described herein.

Also described herein is a method of producing a bi-specific antigen binding polypeptide construct, said bi-specific construct comprising a first heterodimer comprising a first immunoglobulin heavy chain polypeptide sequence (H1), and a first immunoglobulin light chain polypeptide sequence (L1) from a first mono-specific antigen binding polypeptide; and a second heterodimer comprising a second immunoglobulin heavy chain polypeptide sequence (H2), and a second immunoglobulin light chain polypeptide sequence (L2) from a second mono-specific antigen binding polypeptide, wherein H1 and H2 each comprise at least a heavy chain variable domain (V_(H) domain) and a heavy chain constant domain (C_(H1) domain); wherein L1 and L2 each comprise at least a light chain variable domain (V_(L) domain) and a light chain constant domain (C_(L) domain), the method comprising: obtaining a dataset comprising data representing a set of amino acid modifications within H1, H2, L1 and L2 such that upon introduction of a subset of the modifications into H1, H2, L1 and/or L2, H1 pairs preferentially with L1 as compared to L2 and H2 pairs preferentially with L2 as compared to L1 in a test system; introducing a subset of one or more modifications from the dataset into the first heterodimer and/or the second heterodimer; and co-expressing the first heterodimer and the second heterodimer in at least one host cell to produce an expression product comprising the bi-specific construct.

In some aspects, the method further comprises determining the amount of the bi-specific construct in the expression product relative to other polypeptide products. In some aspects, the bi-specific construct is produced with a purity of greater than 70% compared to the other polypeptide products. In some aspects, the dataset is a dataset described herein. In some aspects, the method further comprises the step of adding additional amino acid modifications to at least one of H1, H2, L1, or L2 to increase the purity of the bi-specific construct compared to the other polypeptide products. In some aspects, the construct comprises an Fc comprising at least two C_(H3) sequences, wherein the Fc is coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer. In some aspects, the Fc is a heterodimer, optionally comprising one or more amino acid modifications. In some aspects, the antigen binding polypeptide is an antibody, a Fab, or a scFv.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 provides a Table showing the preferential pairing of heterodimers, from antigen-binding constructs described herein, in LCCA design sets where amino acid modifications have been made to the V_(H) or V_(L) domains.

FIG. 2 provides a Table showing the preferential pairing of heterodimers in LCCA design sets where amino acid modifications have been made to the C_(H1) or C_(L) domains.

FIG. 3 provides a Table showing the thermal stability and affinity for antigen for selected preferentially paired or mispaired heterodimers from antigen-binding constructs described herein.

FIG. 4 provides thermal unfolding curves for selected heterodimers.

FIG. 5 provides size exclusion chromatography profiles for selected heterodimers.

FIGS. 6A-E depicts D3H44 heavy chain and light chain amino acid sequences aligned against canonical human germline sequences for Variable, Constant and J-region segments. (Notations in figs: * sequence identity, # Interface hotspots (specificity drivers), + mutated residues tested in designs). FIG. 6A depicts Human VH germline subgroups (one representative sequence is displayed for each family). Sequence identity based on an alignment of D3H44 against VH3 and IGHJ3*02. FIG. 6A discloses SEQ ID NOS 20-34, respectively, in order of appearance. FIG. 6B depicts Human kappa VL germline subgroups (one representative sequence is displayed from each family). Sequence identity based on an alignment of D3H44 against VKI and IGKJ1*01. FIG. 6B discloses SEQ ID NOS 35-47, respectively, in order of appearance. FIG. 6C depicts Human lambda VL germline subgroups (one representative sequence is displayed from each family). Sequence identity based on an alignment of D3H44 against VL1 and IGLJ1*01. FIG. 6C discloses SEQ ID NOS 35, 48-57, 42, and 58-64, respectively, in order of appearance. FIG. 6D depicts human CH1 allele sequences. FIG. 6D discloses SEQ ID NOS 65-74, respectively, in order of appearance. FIG. 6E depicts Human kappa and lambda allele sequences. FIG. 6E discloses SEQ ID NOS 75-80, 75, and 81-85, respectively, in order of appearance.

FIG. 7 depicts a flow chart outlining a strategy for designing a bi-specific antibody.

FIG. 8 illustrates a high level schematic overview of the engineering requirements for forming a bispecific Mab (monoclonal antibody), and the assay requirements needed to quantify heavy chain light chain pairs. The design goal of engineering a bispecific Mab with high purity (i.e., little or no mispaired H-L associations) can be achieved by rationally engineering (via the introduction of specific amino acid mutations) the preferential pairing of two unique heavy chains for their unique cognate light chains. This process is shown schematically; here H1 has been engineered to preferentially pair with L1 and not L2. Likewise, H2 has been engineered to preferentially pair with L2 and not L1. The experimental screening of bispecific Mab designs requires an assay capable of simultaneously quantifying H1-L1:H1-L2 and H2-L2:H2-L1. These assay requirements can be simplified by assuming that each bispecific Fab arm can be independently engineered. In this case, the assay would only need to quantify H1-L1:H1-L2 or H2-L2:H2-L1, and not both simultaneously.

FIG. 9 provides a schematic depicting how heavy chains and light chains are tagged and preferential pairing is determined. In this schematic, the circle represents a cell in which 3 constructs are transfected. The expression products are secreted from the cell and the supernatant (SPNT) is flowed over a detection device, in this case an SPR chip. Based on the detection level of the two different tags fused to the two light chains competing for heavy chain pairing, a quantitative estimate of the preferential pairing of the heavy chain to the two light chains can be estimated.

FIG. 10 depicts the heavy chain associated products expected when each of two full-length heavy chains is independently co-expressed with two different light chains. Preferential pairing is assessed using MCA.

FIG. 11 depicts an exemplary set of H1, L1, H2, L2 chains which have been designed such that H1 preferentially pairs with L1 over L2 and H2 preferentially pairs with L2 over L. A cartoon representation of the 3D crystal structure of the variable region heavy and light chain interface is presented. The mutations introduced at the interface achieve electrostatic and steric complementarity in the two set of variable region interface respectively for the preferentially forming obligate pair. On the other hand, there is unfavorable steric and electrostatic mismatch in the incorrect pair that would result in reduced pairing propensity for the mismatched pair as well as reduced stability.

FIG. 12 depicts LC/MS spectra resulting when H1, L1 and L2 are co-expressed (left panel) and when H2, L1, and L2 are co-expressed (right panel), based on the exemplary set of H1, L1, H2, L2 chains shown in FIG. 11.

FIGS. 13 A-C depicts assessment of the biophysical properties of the H1-L1 and H2-L2 pairs based on the design shown in FIG. 11. FIG. 13A shows non-reducing SDS-PAGE analysis of the H1-L1 and H2-L2 pairs before and after protein A purification, and the yield of product is shown at the bottom of the gels; FIG. 13B shows the DSC thermograms of the H1-L1 and H2-L2 paired products; and FIG. 13C shows UPLC-SEC (H2-L2) profile for the H2-L2 pair.

FIG. 14 depicts the heavy chain associated products expected when two different light chains are co-expressed with two different heavy chains in a cell. Preferential pairing is assessed using an SMCA (monoclonal antibody competition assay).

FIG. 15 depicts the LC/MS spectra for the bispecific antigen-binding construct (H1-L1_H2-L2) based on design shown in FIG. 11.

FIG. 16A Left panel depicts an assessment of the purity of the bispecific antigen-binding construct derived from design set as depicted in FIG. 11. FIG. 16A shows a Coomassie stained non-reducing SDS-PAGE of the SMCA variant, along with a control variant composed of Her2 binding Mab with a heterodimeric Fc. The purity of the SMCA variant, after protein A (ProtA) purification, is high and is qualitatively equivalent to the control. The estimated post-ProtA yield of the SMCA variant is 20 mg/L and is comparable to the control yield of 40 mg/ml. Right panel (FIG. 16B) depicts the SEC profile of the aforementioned bispecific construct. The main peak (>90% total peak area) observed runs at a molecular weight of ˜150KDa and is composed of Mab monomers. The observed minor peak runs at ˜75KDa and consists of half-antibodies. No significant higher molecular weight peaks (potentially indicative of aggregate species) are observed.

FIG. 17A Upper panel depicts SPR data showing monospecific binding for TF or Her2 antigen by the bispecific construct based on design set depicted in FIG. 11 (Note: This SMCA design utilizes the same H1-L1 and H2-L2 MCA designs as depicted in FIG. 11). Lower panel depicts SPR data showing simultaneous bispecific binding of TF and Her2 antigens by the bispecific antigen-binding construct.

FIG. 17B Upper panel depicts SPR data showing monospecific binding for tissue factor (TF) or Her2 antigen by the bispecific construct based on design set depicted in FIG. 11 (Note: This SMCA design utilizes the same H1-L1 and H2-L2 MCA designs as depicted in FIG. 11). Lower panel depicts SPR data showing simultaneous bispecific binding of TF and Her2 antigens by the bispecific antigen binding construct.

FIG. 18 depicts a flowchart for identifying critical interface residues and for computational modeling of designs with preferential heavy-light chain pairing.

FIG. 19 presents a schematic depicting a method of preparing a bi-specific antibody using the library of obligate mutation pairs provided in this invention.

DETAILED DESCRIPTION

Provided herein are antigen binding polypeptide constructs (also referred to as heterodimer pairs) which can comprise a first heterodimer and a second heterodimer wherein each heterodimer comprises an immunoglobulin heavy chain or fragment thereof and an immunoglobulin light chain. At least one of the heterodimers can comprise one or more amino acid modifications in the immunoglobulin heavy chain constant domain 1 (CH1) and one or more amino acid modifications in the immunoglobulin light chain constant domain (CL); one or more amino acid modifications in the immunoglobulin heavy chain variable domain (VH) and one or more amino acid modifications in the immunoglobulin light chain variable domain (VL); or a combination of the preceding amino acid modifications to both the constant and variable domains of the heavy and light chains. The amino acids that are modified are typically part of the interface between the light chain and heavy chain and are modified to create preferential pairing between each heavy chain and the desired light chain such that the heavy chain of the first heterodimer preferentially pairs with one of the light chains rather than the other. Likewise, the heavy chain of the second heterodimer can preferentially pair with the second light chain rather than first.

As noted above, specific combinations of the amino acid modifications described herein promote preferential pairing of heavy chains with specific light chains, thus enabling bi-specific monoclonal antibody (Mab) expression to occur with negligible or limited mispairing, and minimizing the need to purify the desired heterodimers from undesired, or mispaired products. The heterodimers can exhibit comparable thermal stability to heterodimers that do not include the amino acid modifications, and can also demonstrate binding affinity for antigen that is comparable to heterodimers that do not include the amino acid modifications.

The designs of the first and second heterodimers, can be used to create bi-specific antibodies targeting two different therapeutic targets or targeting two distinct epitopes (overlapping or non-overlapping) within the same antigen.

The invention further provides methods of preparing the heterodimer pairs according to the invention.

Definitions

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which the claimed subject matter belongs. In the event that there are a plurality of definitions for terms herein, those in this section prevail. Where reference is made to a URL or other such identifier or address, it is understood that such identifiers can change and particular information on the internet can come and go, but equivalent information can be found by searching the internet. Reference thereto evidences the availability and public dissemination of such information.

It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of any subject matter claimed. In this application, the use of the singular includes the plural unless specifically stated otherwise.

In the present description, any concentration range, percentage range, ratio range, or integer range is to be understood to include the value of any integer within the recited range and, when appropriate, fractions thereof (such as one tenth and one hundredth of an integer), unless otherwise indicated. As used herein, “about” means±10% of the indicated range, value, sequence, or structure, unless otherwise indicated. It should be understood that the terms “a” and “an” as used herein refer to “one or more” of the enumerated components unless otherwise indicated or dictated by its context. The use of the alternative (e.g., “or”) should be understood to mean either one, both, or any combination thereof of the alternatives. As used herein, the terms “include” and “comprise” are used synonymously. In addition, it should be understood that the individual single chain polypeptides or immunoglobulin constructs derived from various combinations of the structures and substituents described herein are disclosed by the present application to the same extent as if each single chain polypeptide or heterodimer were set forth individually. Thus, selection of particular components to form individual single chain polypeptides or heterodimers is within the scope of the present disclosure

The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described. All documents, or portions of documents, cited in the application including, but not limited to, patents, patent applications, articles, books, manuals, and treatises are hereby expressly incorporated by reference in their entirety for any purpose.

It is to be understood that the methods and compositions described herein are not limited to the particular methodology, protocols, cell lines, constructs, and reagents described herein and as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the methods and compositions described herein, which will be limited only by the appended claims.

All publications and patents mentioned herein are incorporated herein by reference in their entirety for the purpose of describing and disclosing, for example, the constructs and methodologies that are described in the publications, which might be used in connection with the methods, compositions and compounds described herein. The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the inventors described herein are not entitled to antedate such disclosure by virtue of prior invention or for any other reason.

In the present application, amino acid names and atom names (e.g. N, O, C, etc.) are used as defined by the Protein DataBank (PDB) (www.pdb.org), which is based on the IUPAC nomenclature (IUPAC Nomenclature and Symbolism for Amino Acids and Peptides (residue names, atom names etc.), Eur. J. Biochem., 138, 9-37 (1984) together with their corrections in Eur. J. Biochem., 152, 1 (1985). The term “amino acid residue” is primarily intended to indicate an amino acid residue contained in the group consisting of the 20 naturally occurring amino acids, i.e. alanine (Ala or A), cysteine (Cys or C), aspartic acid (Asp or D), glutamic acid (Glu or E), phenylalanine (Phe or F), glycine (Gly or G), histidine (His or H), isoleucine (Ile or I), lysine (Lys or K), leucine (Leu or L), methionine (Met or M), asparagine (Asn or N), proline (Pro or P), glutamine (Gln or Q), arginine (Arg or R), serine (Ser or S), threonine (Thr or T), valine (Val or V), tryptophan (Trp or W), and tyrosine (Tyr or Y) residues.

The terms “polypeptide,” “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. That is, a description directed to a polypeptide applies equally to a description of a peptide and a description of a protein, and vice versa. The terms apply to naturally occurring amino acid polymers as well as amino acid polymers in which one or more amino acid residues is a non-naturally encoded amino acid. As used herein, the terms encompass amino acid chains of any length, including full length proteins, wherein the amino acid residues are linked by covalent peptide bonds.

The term “nucleotide sequence” or “nucleic acid sequence” is intended to indicate a consecutive stretch of two or more nucleotide molecules. The nucleotide sequence may be of genomic, cDNA, RNA, semisynthetic or synthetic origin, or any combination thereof.

“Cell”, “host cell”, “cell line” and “cell culture” are used interchangeably herein and all such terms should be understood to include progeny resulting from growth or culturing of a cell. “Transformation” and “transfection” are used interchangeably to refer to the process of introducing a nucleic acid sequence into a cell.

The term “amino acid” refers to naturally occurring and non-naturally occurring amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally encoded amino acids are the 20 common amino acids (alanine, arginine, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, praline, serine, threonine, tryptophan, tyrosine, and valine) and pyrrolysine and selenocysteine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, such as, homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (such as, norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Reference to an amino acid includes, for example, naturally occurring proteogenic L-amino acids; D-amino acids, chemically modified amino acids such as amino acid variants and derivatives; naturally occurring non-proteogenic amino acids such as alanine, ornithine, etc.; and chemically synthesized compounds having properties known in the art to be characteristic of amino acids. Examples of non-naturally occurring amino acids include, but are not limited to, □-methyl amino acids (e.g. methyl alanine), D-amino acids, histidine-like amino acids (e.g., 2-amino-histidine, hydroxy-histidine, homohistidine), amino acids having an extra methylene in the side chain (“homo” amino acids), and amino acids in which a carboxylic acid functional group in the side chain is replaced with a sulfonic acid group (e.g., cysteic acid). The incorporation of non-natural amino acids, including synthetic non-native amino acids, substituted amino acids, or one or more D-amino acids into the proteins of the present invention may be advantageous in a number of different ways. D-amino acid-containing peptides, etc., exhibit increased stability in vitro or in vivo compared to L-amino acid-containing counterparts. Thus, the construction of peptides, etc., incorporating D-amino acids can be particularly useful when greater intracellular stability is desired or required. More specifically, D-peptides, etc., are resistant to endogenous peptidases and proteases, thereby providing improved bioavailability of the molecule, and prolonged lifetimes in vivo when such properties are desirable. Additionally, D-peptides, etc., cannot be processed efficiently for major histocompatibility complex class II-restricted presentation to T helper cells, and are therefore, less likely to induce humoral immune responses in the whole organism.

Amino acids are referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes.

“Conservatively modified variants” applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, “conservatively modified variants” refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are “silent variations,” which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of ordinary skill in the art will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in each described sequence.

As to amino acid sequences, one of ordinary skill in the art will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the deletion of an amino acid, addition of an amino acid, or substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are known to those of ordinary skill in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.

Conservative substitution tables providing functionally similar amino acids are known to those of ordinary skill in the art. The following eight groups each contain amino acids that are conservative substitutions for one another:

1) Alanine (A), Glycine (G);

2) Aspartic acid (D), Glutamic acid (E);

3) Asparagine (N), Glutamine (Q);

4) Arginine (R), Lysine (K);

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V);

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W);

7) Serine (S), Threonine (T); and

8) Cysteine (C), Methionine (M)

(see, e.g., Creighton, Proteins: Structures and Molecular Properties (W H Freeman & Co.; 2nd edition (December 1993).

The terms “identical” or percent “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same. Sequences are “substantially identical” if they have a percentage of amino acid residues or nucleotides that are the same (i.e., about 50% identity, about 55% identity, 60% identity, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95% or about 99% identity over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms (or other algorithms available to persons of ordinary skill in the art) or by manual alignment and visual inspection. This definition also refers to the complement of a test sequence. The identity can exist over a region that is at least about 50 amino acids or nucleotides in length, or over a region that is 75-100 amino acids or nucleotides in length, or, where not specified, across the entire sequence of a polynucleotide or polypeptide. A polynucleotide encoding a polypeptide of the present invention, including homologs from species other than human, may be obtained by a process comprising the steps of screening a library under stringent hybridization conditions with a labeled probe having a polynucleotide sequence of the invention or a fragment thereof, and isolating full-length cDNA and genomic clones containing said polynucleotide sequence. Such hybridization techniques are well known to the skilled artisan.

A derivative, or a variant of a polypeptide is said to share “homology” or be “homologous” with the peptide if the amino acid sequences of the derivative or variant has at least 50% identity with a 100 amino acid sequence from the original peptide. In certain embodiments, the derivative or variant is at least 75% the same as that of either the peptide or a fragment of the peptide having the same number of amino acid residues as the derivative. In certain embodiments, the derivative or variant is at least 85% the same as that of either the peptide or a fragment of the peptide having the same number of amino acid residues as the derivative. In certain embodiments, the amino acid sequence of the derivative is at least 90% the same as the peptide or a fragment of the peptide having the same number of amino acid residues as the derivative. In some embodiments, the amino acid sequence of the derivative is at least 95% the same as the peptide or a fragment of the peptide having the same number of amino acid residues as the derivative. In certain embodiments, the derivative or variant is at least 99% the same as that of either the peptide or a fragment of the peptide having the same number of amino acid residues as the derivative.

As used herein, an “isolated” polypeptide or construct means a construct or polypeptide that has been identified and separated and/or recovered from a component of its natural cell culture environment. Contaminant components of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the heteromultimer, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes.

In certain embodiments, as used herein, “isolated” antigen-binding constructs described herein comprise heterodimer pairs or “isolated” heterodimer pairs that comprise a heterodimer or heterodimer pair that has been identified and separated and/or recovered from a component of its natural cell culture environment. Contaminant components of its natural environment are materials that would interfere with diagnostic or therapeutic uses for the heterodimer or antigen-binding construct, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes.

The heterodimers and antigen-binding constructs and heterodimer pairs are generally purified to substantial homogeneity. The phrases “substantially homogeneous”, “substantially homogeneous form” and “substantial homogeneity” are used to indicate that the product is substantially devoid of by-products originated from undesired polypeptide combinations (e.g. homodimers). Expressed in terms of purity, substantial homogeneity means that the amount of by-products does not exceed 10%, and preferably is below 5%, more preferably below 1%, most preferably below 0.5%, wherein the percentages are by weight.

The phrase “selectively (or specifically) hybridizes to” refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent hybridization conditions when that sequence is present in a complex mixture (including but not limited to, total cellular or library DNA or RNA).

Terms understood by those in the art of antibody technology are each given the meaning acquired in the art, unless expressly defined differently herein. Antibodies are known to have variable regions, a hinge region, and constant domains. Immunoglobulin structure and function are reviewed, for example, in Harlow et al, Eds., Antibodies: A Laboratory Manual, Chapter 14 (Cold Spring Harbor Laboratory, Cold Spring Harbor, 1988).

As used herein, the terms “antibody” and “immunoglobulin” or “antigen binding polypeptide” are used interchangeably. An “antigen binding polypeptide” refers to a polypeptide substantially encoded by an immunoglobulin gene or immunoglobulin genes, or one or more fragments thereof, which specifically bind an analyte (antigen). The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as the myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin isotypes, IgG, IgM, IgA, IgD, and IgE, respectively. Further, the antibody can belong to one of a number of subtypes, for instance, the IgG can belong to the IgG1, IgG2, IgG3, or IgG4 subclasses.

An exemplary immunoglobulin (antibody) structural unit is composed of two pairs of polypeptide chains, each pair having one “light” (about 25 kD) and one “heavy” chain (about 50-70 kD). The term “light chain” includes a full-length light chain and fragments thereof having sufficient variable region sequence to confer binding specificity. A full-length light chain includes a variable region domain, VL, and a constant region domain, CL. The variable region domain of the light chain is at the amino-terminus of the polypeptide. Light chains include kappa chains and lambda chains. The term “heavy chain” includes a full-length heavy chain and fragments thereof having sufficient variable region sequence to confer binding specificity. A full-length heavy chain includes a variable region domain, VH, and three constant region domains, CH1, CH2, and CH3. The VH domain is at the amino-terminus of the polypeptide, and the CH domains are at the carboxyl-terminus, with the CH3 being closest to the carboxy-terminus of the polypeptide. Heavy chains can be of any isotype, including IgG (including IgG1, IgG2, IgG3 and IgG4 subclasses), IgA (including IgA1 and IgA2 subclasses), IgM and IgE. The term “variable region” or “variable domain” refers to a portion of the light and/or heavy chains of an antibody generally responsible for antigen recognition, typically including approximately the amino-terminal 120 to 130 amino acids in the heavy chain (VH) and about 100 to 110 amino terminal amino acids in the light chain (VL). A “complementarity determining region” or “CDR” is an amino acid sequence that contributes to antigen binding specificity and affinity. “Framework” regions (FR) can aid in maintaining the proper conformation of the CDRs to promote binding between the antigen binding region and an antigen. Structurally, framework regions can be located in antibodies between CDRs. The variable regions typically exhibit the same general structure of relatively conserved framework regions (FR) joined by three hyper variable regions, CDRs. The CDRs from the two chains of each pair typically are aligned by the framework regions, which can enable binding to a specific epitope. From N-terminal to C-terminal, both light and heavy chain variable regions typically comprise the domains FR1, CDR1, FR2, CDR2, FR3, CDR3, and FR4. The assignment of amino acids to each domain is typically in accordance with the definitions of Kabat Sequences of Proteins of Immunological Interest (National Institutes of Health, Bethesda, Md. (1987 and 1991)), unless stated otherwise. In certain embodiments, the immunoglobulin constructs comprise at least one immunoglobulin domain from IgG, IgM, IgA, IgD, or IgE connected to a therapeutic polypeptide. In some embodiments, the immunoglobulin domain comprised in an immunoglobulin construct provided herein, is from an immunoglobulin based construct such as a diabody, or a nanobody. In certain embodiments, the immunoglobulin constructs described herein comprise at least one immunoglobulin domain from a heavy chain antibody such as a camelid antibody. In certain embodiments, the immunoglobulin constructs provided herein comprise at least one immunoglobulin domain from a mammalian antibody such as a bovine antibody, a human antibody, a camelid antibody, a mouse antibody or any chimeric antibody.

A “bi-specific,” “dual-specific” or “bifunctional” antigen binding protein or antibody is a hybrid antigen binding protein having two different antigen binding sites. Bispecific antigen binding proteins and antibodies are a species of multispecific antigen binding protein antibody. The two binding sites of a bispecific antigen binding protein or antibody will bind to two different epitopes, which can reside on the same or different molecular targets. A “multispecific antigen binding protein” or “multispecific antibody” is one that targets more than one antigen or epitope. A “bivalent antigen binding protein” or “bivalent antibody” comprises two antigen binding sites. In some instances, the two binding sites have the same antigen specificities. Bivalent antigen binding proteins and bivalent antibodies can be bispecific, see, infra. A bivalent antibody other than a “multispecific” or “multifunctional” antibody, in certain embodiments, typically is understood to have each of its binding sites identical.

The term “preferential pairing” is used herein to describe the pairing pattern of a first polypeptide with a second polypeptide, e.g., an immunoglobulin heavy chain with an immunoglobulin light chain in the antigen-binding constructs and heterodimer pairs described herein. As such, “preferential pairing” refers to the preferred pairing of a first polypeptide with a second polypeptide when one or more additional, distinct polypeptides are present at the same time as the pairing occurs between the first and second polypeptide. Typically preferential pairing occurs as a result of the modification (e.g., amino acid modification) of one or both of the first and second polypeptide. Typically preferential pairing results in the paired first and second polypeptide being the most abundant dimer present after pairing occurs. It is known in the art that an immunoglobulin heavy chain (H1) will if co-expressed with two different immunoglobulin light chains (L1 and L2), statistically pair equally with both light chains, resulting in an approximate 50:50 mixture of H1 paired with L1 and H1 paired with L2. In this context, “preferential pairing” would occur between, for example, H1 and L1, if the amount of the H1-L1 heavy chain-light chain heterodimer was greater than the amount of the H1-L2 heterodimer when H1 is co-expressed with both L1 and L2. Thus, in this case, H1 preferentially pairs with L1 relative to L2.

Antibody heavy chains pair with antibody light chains and meet or contact one another at an “interface.” The “interface” includes one or more “contact” amino acid residues in a first polypeptide that interact with one or more “contact” amino acid residues of a second polypeptide. In one context, the term interface can be used to describe the interface of the dimerized CH3 domain of an Fc, where the Fc is preferably derived from an IgG antibody such as IgG1 and most preferably a human IgG1 antibody.

The antibody heavy chain that is to be associated with an antibody light chain typically meet or contact each other at an “interface.” The immunoglobulin light chain operatively associates with the immunoglobulin heavy chain via the “interface”. The “interface” comprises those one or more “contact” amino acid residues in the immunoglobulin heavy chain that interact with one or more “contact” amino acid residues in the interface of the immunoglobulin light chain. As used herein, the interface can comprise the VH and CH1 domains of the immunoglobulin heavy chain and the VL and CL domains of the immunoglobulin light chain. The “interface” can be derived from an IgG antibody and most preferably a human IgG1 antibody.

The term “amino acid modifications” as used herein includes, but is not limited to, amino acid mutations, insertions, deletions, substitutions, chemical modifications, physical modifications, and rearrangements.

Antigen Binding Constructs and Heterodimer Pairs

The antigen-binding constructs described herein can comprise a first heterodimer and a second heterodimer; each heterodimer obtained by pairing an immunoglobulin heavy chain with an immunoglobulin light chain. The structure and organization of the constant and variable domains of immunoglobulin heavy and light chains are well known in the art. Immunoglobulin heavy chains typically comprise one variable (VH) domain, and three constant domains, CH1, CH2, and CH3. Immunoglobulin light chains typically comprise one variable (VL) domain and one constant (CL) domain. Various modifications to these typical formats can be made.

The antigen-binding constructs and heterodimer pairs described herein can comprise a first heterodimer and a second heterodimer, each heterodimer comprising an immunoglobulin/antibody heavy chain or fragment thereof having at least a VH and CH1 domain, and an immunoglobulin/antibody light chain having a VL domain and a CL domain. In one embodiment, both heterodimers of the heterodimer pair and antigen-binding construct comprise a full-length immunoglobulin heavy chain. In another embodiment, both heterodimers of the heterodimer pair or antigen-binding construct comprise a fragment of the immunoglobulin heavy chain that includes at least a VH and a CH1 domain. In one embodiment, both heterodimers of the heterodimer pair comprise an amino terminal fragment of the immunoglobulin heavy chain that comprises at least a VH and a CH1 domain. In another embodiment, both heterodimers of the heterodimer pair comprise a carboxy terminal fragment of the immunoglobulin heavy chain that comprises at least a VH and a CH1 domain.

Each heterodimer of the heterodimer pair can bind specifically to an antigen or epitope. In one embodiment, the immunoglobulin heavy chain and the immunoglobulin light chain of each heterodimer is derived or engineered from a known therapeutic antibody. A therapeutic antibody is one that is effective in treating a disease or disorder in a mammal with or predisposed to the disease or disorder. Suitable therapeutic antibodies from which each heterodimer can be derived include, but are not limited to abagovomab, adalimumab, alemtuzumab, aurograb, bapineuzumab, basiliximab, belimumab, bevacizumab, briakinumab, canakinumab, catumaxomab, certolizumab pegol, cetuximab, daclizumab, denosumab, efalizumab, galiximab, gemtuzumab ozogamicin, golimumab, ibritumomab tiuxetan, infliximab, ipilimumab, lumiliximab, mepolizumab, motavizumab, muromonab, mycograb, natalizumab, nimotuzumab, ocrelizumab, ofatumumab, omalizumab, palivizumab, panitumumab, pertuzumab, ranibizumab, reslizumab, rituximab, teplizumab, tocilizumab/atlizumab, tositumomab, trastuzumab, Proxinium™, Rencarex™, ustekinumab, and zalutumumab.

In one embodiment, the immunoglobulin heavy chain and the immunoglobulin light chain of each heterodimer are derived or engineered from an antibody that binds a molecule including, but not limited to, the following list of proteins, as well as subunits, domains, motifs and epitopes belonging to the following list of proteins: renin; a growth hormone, including human growth hormone and bovine growth hormone; growth hormone releasing factor;

-   -   parathyroid hormone; thyroid stimulating hormone; lipoproteins;         alpha-1-antitrypsin; insulin A-chain; insulin B-chain;         proinsulin; follicle stimulating hormone; calcitonin;         luteinizing hormone; glucagon; clotting factors such as factor         VII, factor VIIIC, factor IX, tissue factor (TF), and von         Willebrands factor; anti-clotting factors such as Protein C;         atrial natriuretic factor; lung surfactant; a plasminogen         activator, such as urokinase or human urine or tissue-type         plasminogen activator (t-PA); bombesin; thrombin; hemopoietic         growth factor; tumor necrosis factor-alpha and -beta;         enkephalinase; RANTES (regulated on activation normally T-cell         expressed and secreted); human macrophage inflammatory protein         (MIP-1-alpha); a serum albumin such as human serum albumin;         Muellerian-inhibiting substance; relaxin A-chain; relaxin         B-chain; prorelaxin; mouse gonadotropin-associated peptide; a         microbial protein, such as beta-lactamase; DNase; IgE; a         cytotoxic T-lymphocyte associated antigen (CTLA), such as         CTLA-4; inhibin; activin; vascular endothelial growth factor         (VEGF); receptors for hormones or growth factors such as, for         example, EGFR, VEGFR; interferons such as alpha interferon         (alpha-IFN), beta interferon (beta-IFN) and gamma interferon         (gamma-IFN); protein A or D; rheumatoid factors; a neurotrophic         factor such as bone-derived neurotrophic factor (BDNF),         neurotrophin-3, -4, -5, or -6 (NT-3, NT-4, NT-5, or NT-6), or a         nerve growth factor; platelet-derived growth factor (PDGF);         fibroblast growth factor such as AFGF and PFGF; epidermal growth         factor (EGF); transforming growth factor (TGF) such as TGF-alpha         and TGF-beta, including TGF-1, TGF-2, TGF-3, TGF-4, or TGF-5;         insulin-like growth factor-I and -II (IGF-I and IGF-II); des         (1-3)-IGF-I (brain IGF-I), insulin-like growth factor binding         proteins; CD proteins such as CD2, CD3, CD4, CD8, CD11a, CD14,         CD18, CD19, CD20, CD22, CD23, CD25, CD33, CD34, CD40, CD40L,         CD52, CD63, CD64, CD80 and CD147; erythropoietin; osteoinductive         factors; immunotoxins; a bone morphogenetic protein (BMP); an         interferon such as interferon-alpha, -beta, and -gamma; colony         stimulating factors (CSFs), such as M-CSF, GM-CSF, and G-CSF;         interleukins (ILs), e.g., IL-1 to IL-13; TNF-alpha, superoxide         dismutase; T-cell receptors; surface membrane proteins; decay         accelerating factor; viral antigen such as, for example, a         portion of the AIDS envelope, e.g., gp120; transport proteins;         homing receptors; addressins; regulatory proteins; cell adhesion         molecules such as LFA-1, Mac 1, p150.95, VLA-4, ICAM-1, ICAM-3         and VCAM, a4/p7 integrin, and (Xv/p3 integrin including either a         or subunits thereof, integrin alpha subunits such as CD49a,         CD49b, CD49c, CD49d, CD49e, CD49f, alpha7, alpha8, alpha9,         alphaD, CD11a, CD11b, CD51, CD11c, CD41, alphaIIb, alphaIELb;         integrin beta subunits such as, CD29, CD 18, CD61, CD104, beta5,         beta6, beta7 and beta8; Integrin subunit combinations including         but not limited to, alphaVbeta3, alphaVbeta5 and alpha4beta7; a         member of an apoptosis pathway; IgE; blood group antigens;         flk2/flt3 receptor; obesity (OB) receptor; mpl receptor; CTLA-4;         protein C; an Eph receptor such as EphA2, EphA4, EphB2, etc.; a         Human Leukocyte Antigen (HLA) such as HLA-DR; complement         proteins such as complement receptor CR1, C1Rq and other         complement factors such as C3, and C5; a glycoprotein receptor         such as GpIb.alpha., GPIIb/IIIa and CD200; and fragments of any         of the above-listed polypeptides.

In an embodiment, the immunoglobulin heavy and light chains of each heterodimer are derived or engineered from antibodies that specifically bind cancer antigens including, but not limited to, ALK receptor (pleiotrophin receptor), pleiotrophin, KS 1/4 pan-carcinoma antigen; ovarian carcinoma antigen (CA125); prostatic acid phosphate; prostate specific antigen (PSA); melanoma-associated antigen p97; melanoma antigen gp75; high molecular weight melanoma antigen (HMW-MAA); prostate specific membrane antigen; carcinoembryonic antigen (CEA); polymorphic epithelial mucin antigen; human milk fat globule antigen; colorectal tumor-associated antigens such as: CEA, TAG-72, CO17-1A, GICA 19-9, CTA-1 and LEA; Burkitt's lymphoma antigen-38.13; CD19; human B-lymphoma antigen-CD20; CD33; melanoma specific antigens such as ganglioside GD2, ganglioside GD3, ganglioside GM2 and ganglioside GM3; tumor-specific transplantation type cell-surface antigen (TSTA); virally-induced tumor antigens including T-antigen, DNA tumor viruses and Envelope antigens of RNA tumor viruses; oncofetal antigen-alpha-fetoprotein such as CEA of colon, 514 oncofetal trophoblast glycoprotein and bladder tumor oncofetal antigen; differentiation antigen such as human lung carcinoma antigens L6 and L20; antigens of fibrosarcoma; human leukemia T cell antigen-Gp37; neoglycoprotein; sphingolipids; breast cancer antigens such as EGFR (Epidermal growth factor receptor); NY-BR-16; NY-BR-16 and HER2 antigen (p185HER2); polymorphic epithelial mucin (PEM); malignant human lymphocyte antigen-APO-1; differentiation antigen such as I antigen found in fetal erythrocytes; primary endoderm I antigen found in adult erythrocytes; preimplantation embryos; I(Ma) found in gastric adenocarcinomas; M18, M39 found in breast epithelium; SSEA-1 found in myeloid cells; VEP8; VEP9; Myl; Va4-D5; D156-22 found in colorectal cancer; TRA-1-85 (blood group H); SCP-1 found in testis and ovarian cancer; C14 found in colonic adenocarcinoma; F3 found in lung adenocarcinoma; AH6 found in gastric cancer; Y hapten; Ley found in embryonal carcinoma cells; TL5 (blood group A); EGF receptor found in A431 cells; E1 series (blood group B) found in pancreatic cancer; FC10.2 found in embryonal carcinoma cells; gastric adenocarcinoma antigen; CO-514 (blood group Lea) found in Adenocarcinoma; NS-10 found in adenocarcinomas; CO-43 (blood group Leb); G49 found in EGF receptor of A431 cells; MH2 (blood group ALeb/Ley) found in colonic adenocarcinoma; 19.9 found in colon cancer; gastric cancer mucins; T5A7 found in myeloid cells; R24 found in melanoma; 4.2, GD3, D1.1, OFA-1, GM2, OFA-2, GD2, and M1:22:25:8 found in embryonal carcinoma cells and SSEA-3 and SSEA-4 found in 4 to 8-cell stage embryos; Cutaneous Tcell Lymphoma antigen; MART-1 antigen; Sialy Tn (STn) antigen; Colon cancer antigen NY-CO-45; Lung cancer antigen NY-LU-12 valiant A; Adenocarcinoma antigen ART 1; Paraneoplastic associated brain-testis-cancer antigen (onconeuronal antigen MA2; paraneoplastic neuronal antigen); Neuro-oncological ventral antigen 2 (NOVA2); Hepatocellular carcinoma antigen gene 520; TUMOR-ASSOCIATED ANTIGEN CO-029; Tumor-associated antigens MAGE-C1 (cancer/testis antigen CT7), MAGE-B1 (MAGE-XP antigen), MAGE-B2 (DAM6), MAGE-2, MAGE-4-a, MAGE-4-b and MAGE-X2; Cancer-Testis Antigen (NY-EOS-1) and fragments of any of the above-listed polypeptides.

Human antibodies can be grouped into isotypes including IgG, IgA, IgE, IgM, and IgD. In one embodiment, an Fc is derived from an IgG isotype. In another embodiment, an Fc is derived from an IgA isotype. In another embodiment, an Fc is derived from an IgE isotype. In another embodiment, an Fc is derived from an IgM isotype. In another embodiment, an Fc is derived from an IgD isotype.

Human IgG antibodies can also be divided into the subclasses IgG1, IgG2, IgG3, and IgG4. Thus, in some embodiments, it is contemplated an Fc can be derived from an IgG1, IgG2, IgG3, or IgG4 subclass of antibodies.

Each heterodimer of the heterodimer pair can bind specifically to an epitope or antigen. In one embodiment, each heterodimer of the heterodimer pair binds to the same epitope. In another embodiment, the first heterodimer of the heterodimer pair specifically binds to an epitope on one antigen and the second heterodimer of the heterodimer pair binds specifically to a different epitope on the same antigen. In another embodiment, the first heterodimer of the heterodimer pair specifically binds to an epitope on a first antigen, and the second heterodimer of the heterodimer pair specifically binds to an epitope on a second antigen that is different from the first antigen. For example, in one embodiment, the first heterodimer binds specifically to Tissue Factor, while the second heterodimer binds specifically to antigen Her2(ErbB2). In another embodiment, the first heterodimer binds specifically to a molecule or cancer antigen described above. In another embodiment, the second heterodimer binds specifically to a molecule or cancer antigen described above. In yet another embodiment, the first heterodimer binds specifically to antigen CD3, while the second heterodimer binds specifically to antigen CD19.

As indicated above, in some embodiments, the immunoglobulin heavy chain and the immunoglobulin light chain of each heterodimer can be derived or engineered from a known therapeutic antibody, or from an antibody that binds various target molecules or cancer antigens. The amino acid and nucleotide sequences of numerous such molecules are readily available (see for example, GenBank: AJ308087.1 (Humanized anti-human tissue factor antibody D3H44 light chain variable region and CL domain); GenBank: AJ308086.1 (humanized anti-human tissue factor antibody D3H44 heavy chain variable region and CH1 domain); GenBank: HC359025.1 (Pertuzumab Fab light chain gene module); GenBank: HC359024.1 (Pertuzumab Fab heavy chain gene module); GenBank: GM685465.1 (Antibody Trastuzumab (=Herceptin)—wildtype; light chain); GenBank: GM685463.1 (Antibody Trastuzumab (=Herceptin)—wildtype; heavy chain); GenBank: GM685466.1 (Antibody Trastuzumab (=Herceptin)—GC-optimized light chain); and GenBank: GM685464.1 (Antibody Trastuzumab (=Herceptin)—GC-optimized heavy chain. The sequences of each of the GenBank numbers described herein are available from the NCBI website as of Nov. 28, 2012 and are each incorporated by reference in its entirety for all purposes.

In some aspects, an isolated antigen-binding construct comprises an amino acids sequence that is at least 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identical to an amino acid sequence or fragment thereof set forth in the Tables or accession numbers disclosed herein. In some aspects, an isolated antigen-binding construct comprises an amino acids sequence encoded by a polynucleotide that is at least 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identical to a nucleotide sequence or fragment thereof set forth in Tables or accession numbers disclosed herein.

Amino Acid Modifications to Immunoglobulin Heavy and Light Chains

At least one of the heterodimers of a heterodimer pair can comprise one or more amino acid modifications to their immunoglobulin heavy and/or immunoglobulin light chains such that the heavy chain of the first heterodimer preferentially pairs with one of the light chains rather than the other. Likewise, the heavy chain of the second heterodimer can preferentially pair with the second light chain rather than the first. This preferential pairing of one heavy chain with one of two light chains can be based on design sets comprising one immunoglobulin heavy chain and two immunoglobulin light chains where the immunoglobulin heavy chain preferentially pairs with one of the two immunoglobulin light chains over the other when the immunoglobulin heavy chain is co-expressed with both immunoglobulin light chains. Thus, a LCCA design set can comprise one immunoglobulin heavy chain, a first immunoglobulin light chain and a second immunoglobulin light chain.

In one embodiment, the one or more amino acid modifications comprise one or more amino acid substitutions.

In one embodiment, the preferential pairing demonstrated in the LCCA design set is established by modifying one or more amino acids that are part of the interface between the light chain and heavy chain. In one embodiment, the preferential pairing demonstrated in the LCCA design set is established by modifying one or more amino acids in at least one of the CH1 domain of the immunoglobulin heavy chain, the CL domain of a first immunoglobulin light chain and the CL domain of the second immunoglobulin light chain.

In one embodiment the one or amino acid modifications are limited to the conserved framework residues of the variable (VH, VL) and constant (CH1, CL) domains as indicated by the Kabat numbering of residues. For example, Almagro [Frontiers In Bioscience (2008) 13: 1619-1633] provides a definition of the framework residues on the basis of Kabat, Chotia, and IMGT numbering schemes.

In one embodiment, at least one of the heterodimers comprises one or more mutations introduced in the immunoglobulin heavy and immunoglobulin light chains that are complementary to each other. Complementarity at the heavy and light chain interface can be achieved on the basis of steric and hydrophobic contacts, electrostatic/charge interactions or a combination of the variety of interactions. The complementarity between protein surfaces is broadly described in the literature in terms of lock and key fit, knob into hole, protrusion and cavity, donor and acceptor etc., all implying the nature of structural and chemical match between the two interacting surfaces. In one embodiment, at least one of the heterodimers comprises one or more mutations where the mutations introduced in the immunoglobulin heavy and immunoglobulin light chains introduce a new hydrogen bond across the light and heavy chain at the interface. In one embodiment, at least one of the heterodimers comprises one or more mutations where the mutations introduced in the immunoglobulin heavy and immunoglobulin light chains introduce a new salt bridge across the light and heavy chain at the interface.

Non-limiting examples of suitable LCCA design sets are shown in Table 1, showing amino acid modifications in one immunoglobulin heavy chain CH1 domain (H1) and the two immunoglobulin light chain CL domains (L1 and L2) of the heterodimers, where H1 preferentially pairs with L1 when H1, L1 and L2 are co-expressed. The amino acid modifications shown in these LCCA design sets are based on the amino acid sequence of anti-tissue factor antibody D3H44 immunoglobulin heavy and light chains.

TABLE 1 Selected LCCA design sets with constant domain modifications to one immunoglobulin heavy chain (H1) and two immunoglobulin light chains, L1 and L2, where H1 preferentially pairs with L1 Set # H1_mutation* L1_mutation L2_mutation C500 WT# WT F116A C503 WT WT F98L C505 A139W_V190S F116S F118W_V133S C507 A139W_V190S F116A F118W_V133S C509 A139W WT F116A C510 A139V_V190S F116A F118W_V133S C511 A139V_V190S WT F118W_V133S C513 A139I_V190S F116A F118W_V133S C514 A139I_V190S WT F118W_V133S C515 A139G_V190A L135W_N137A F116A_L135A C517 A139G_V190A L135W F116A_L135A C519 A139G_V190A L135W F116A_L135V C521 S188I WT S176V_T178L C523 V190G F116A F118W_V133S C524 V190G F116S F118W_V133S C525 S188L_V190Y V133S S176L C527 F174V_P175S_S188G S176L WT C530 D146G_Q179R Q124E_Q160E_T178D Q160K_T178R C532 L143A_D144G_Q179R Q124E_V133W_Q160E_T180E V133A_Q160K_T178R *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

Additional non-limiting examples of suitable LCCA design sets are shown in Table 2, showing amino acid modifications in one immunoglobulin heavy chain CH1 domain (H2) and the two immunoglobulin light chain CL domains (L1 and L2) of the heterodimers, where H2 preferentially pairs with L2 when H2, L1 and L2 are co-expressed:

TABLE 2 Selected LCCA design sets with constant domain modifications to one immunoglobulin heavy chain (H2) and two immunoglobulin light chains, L1 and L2, where H2 preferentially pairs with L2 Set # H2_mutation* L1_mutation L2_mutation C501 A139W_V190S WT# F116A C502 A139W_V190A WT F116A C504 F100W WT F98L C506 A139W F116S F118W_V133S C508 A139V F116A F118W_V133S C512 A139I WT F118W_V133S C516 A139W L135W_N137A F116A_L135A C518 A139W L135W F116A_L135A C520 A139W L135W F116A_L135V C522 WT WT S176V_T178L C526 F174V_P175S_S188G V133S S176L C528 F174V_S188L S176L WT C529 S188L S176L WT C531 K145T_Q179D_S188L Q124E_Q160E_T178D Q160K_T178R C533 K145T_Q179D_S188F Q124E_V133W_Q160E_T180E V133A_Q160K_T178R *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

Additional non-limiting examples of suitable LCCA design sets are described in the Examples, Tables, and Figs.

In one embodiment, the LCCA design sets comprises an immunoglobulin heavy chain with at least one amino acid modification in the CH1 domain, a first immunoglobulin light chain with at least one amino acid modification in the CL domain, and a second immunoglobulin light chain without any amino acid modifications in the CL domain. In another embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the CH1 domain, a first immunoglobulin light chain with at least one amino acid modification in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain. In another embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the CH1 domain, a first immunoglobulin light chain with at least two amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain. In another embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the CH1 domain, a first immunoglobulin light chain with at least two amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain.

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with no amino acid modifications in the CH1 domain, a first immunoglobulin light chain with no amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with no amino acid modifications in the CH1 domain, a first immunoglobulin light chain with no amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain.

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the CH1 domain, a first immunoglobulin light chain with no amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least one amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least one amino acid modification in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least two amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least three amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain.

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least three amino acid modifications in the CH1 domain, a first immunoglobulin light chain with no amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least one amino acid modifications in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least three amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least one amino acid modification in the CL domain, and a second immunoglobulin light chain with at least one amino acid modification in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least three amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least three amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the CL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least three amino acid modifications in the CH1 domain, a first immunoglobulin light chain with at least four amino acid modifications in the CL domain, and a second immunoglobulin light chain with at least three amino acid modifications in the CL domain.

In one embodiment, the preferential pairing demonstrated in the LCCA design set is established by modifying one or more amino acids in at least one of the VH domain of the immunoglobulin heavy chain, the VL domain of a first immunoglobulin light chain and the VL domain of the second immunoglobulin light chain. Non-limiting examples of suitable LCCA design sets are shown in Table 3, showing amino acid modifications in one immunoglobulin heavy chain VH domain (H1) and the two immunoglobulin light chain VL domains (L1 and L2) of the heterodimers, where H1 preferentially pairs with L1 when H1, L1 and L2 are co-expressed:

TABLE 3 Selected LCCA design sets with variable domain modifications to one immunoglobulin heavy chain and two immunoglobulin light chains, where H1 preferentially pairs with L1 Set # H1_mutation* L1_mutation L2_mutation V001 V37W_W103H F98L F98W V004 V37W_W103H F98L P44W V005 V37A_W103H P44W F98L V006 V37W_W103F F98L F98W V007 V37W F98A F98W V009 V37W F98A WT# V011 V37I WT F98L V013 V37A_W103V P44W F98A V015 V37A_W103H P44W F98A V016 V37A_W103H P44W F98W V020 L45W Y87G P44W V022 WT F98W F98A V023 WT WT F98A V024 Q39R Q38E F98A V026 Q39R Q38E WT V028 Q39R Q38E Q38R V030 Q39R Q38D Q38R V032 Q39M Q38M Q38E V034 Q39K Q38N_T85E Q38N_T85K V037 Q39E Q38R F98A V039 Q39D Q38R Q38D V040 V37E L89R_F98T WT V042 V37E_F100D L89R_F98W WT V044 V37E_F100D L89R_F98W F98Y *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

Additional non-limiting examples of suitable LCCA design sets are depicted in Table 4, showing amino acid modifications in one immunoglobulin heavy chain VH domain (H2) and the two immunoglobulin light chain VL domains (L1 and L2) of the heterodimers, where H2 preferentially pairs with L2 when H2, L1 and L2 are co-expressed:

TABLE 4 Selected LCCA design sets with variable domain modifications to one immunoglobulin heavy chain and two immunoglobulin light chains, where H2 preferentially pairs with L2 Set # H2_mutation* L1_mutation L2_mutation V002 V37I F98L F98W V003 WT# F98L F98W V005 V37A_W103H F98L P44W V007 V37W F98W F98A V008 V37I F98A F98W V009 V37W WT F98A V010 V37I F98A WT V012 F100W WT F98L V014 V37W P44W F98A V017 V37A_W103V P44W F98W V018 V37A_W103V P44W F98L V019 V37I_F100W P44W F98L V021 V37A_W103H Y87G P44W V025 V37W Q38E F98A V027 WT Q38E WT V029 Q39E Q38E Q38R V030 Q39R Q38R Q38D V031 Q39E Q38D Q38R V033 Q39R Q38M Q38E V035 Q39D Q38N_T85E Q38N_T85K V036 Q39E Q38N_T85E Q38N_T85K V038 V37W Q38R F98A V041 WT L89R_F98T WT V043 WT L89R_F98W WT V045 V37S_A97K L89R_F98W F98Y *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with no amino acid modifications in the VH domain, a first immunoglobulin light chain with no amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with no amino acid modifications in the VH domain, a first immunoglobulin light chain with no amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the VL domain.

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the VH domain, a first immunoglobulin light chain with no amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the VH domain, a first immunoglobulin light chain with at least one amino acid modification in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least one amino acid modification in the VH domain, a first immunoglobulin light chain with at least two amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least two amino acid modifications in the VL domain.

In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the VH domain, a first immunoglobulin light chain with no amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the VH domain, a first immunoglobulin light chain with at least two amino acid modifications in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain. In one embodiment, the LCCA design set comprises an immunoglobulin heavy chain with at least two amino acid modifications in the VH domain, a first immunoglobulin light chain with at least one amino acid modification in the VL domain, and a second immunoglobulin light chain with at least one amino acid modification in the VL domain.

In one embodiment, the LCCA design sets shown in Tables 1 to 4 are combined to provide a combination comprising two distinct immunoglobulin heavy chains (H1 and 112) and two distinct immunoglobulin light chains (L1 and L2), where H1 preferentially pairs with L1 and H2 preferentially pairs with L2 when H1, H2, L1, and L2 are co-expressed. In one embodiment, a LCCA design set from Table 1, comprising modifications to the CH1 domain of the heavy chain and/or the CL domain of the light chains is combined with a LCCA design set from Table 2, also comprising modifications to the CH1 domain of the heavy chain and/or the CL domain of the light chains.

Non-limiting examples of design sets derived from combinations of LCCA design sets are shown in Table 5:

TABLE 5 Design sets comprising constant domain modifications Set # Set # H1_mutation* L1_mutation H2_mutation L2_mutation C500 C501 WT# WT A139W_V190S F116A C500 C502 WT WT A139W_V190A F116A C503 C504 WT WT F100W F98L C505 C506 A139W_V190S F116S A139W F118W_V133S C507 C508 A139W_V190S F116A A139V F118W_V133S C509 C501 A139W WT A139W_V190S F116A C509 C502 A139W WT A139W_V190A F116A C510 C508 A139V_V190S F116A A139V F118W_V133S C511 C512 A139V_V190S WT A139I F118W_V133S C513 C508 A139I_V190S F116A A139V F118W_V133S C514 C512 A139I_V190S WT A139I F118W_V133S C515 C516 A139G_V190A L135W_N137A A139W F116A_L135A C517 C518 A139G_V190A L135W A139W F116A_L135A C519 C520 A139G_V190A L135W A139W F116A_L135V C521 C522 S188I WT WT S176V_T178L C523 C508 V190G F116A A139V F118W_V133S C524 C506 V190G F116S A139W F118W_V133S C525 C526 S188L_V190Y V133S F174V_P175S_S188G S176L C527 C528 F174V_P175S_S188G S176L F174V_S188L WT C527 C529 F174V_P175S_S188G S176L S188L WT C530 C531 D146G_Q179R Q124E_Q160E_T178D K145T_Q179D_S188L Q160K_T178R C532 C533 L143A_D144G_Q179R Q124E_V133W_Q160E_T180E K145T_Q179D_S188F V133A_Q160K_T178R *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

In one embodiment, a LCCA design set from Table 3, comprising modifications to the V_(H) domain of the heavy chain and/or the V_(L) domain of the light chains is combined with a LCCA design set from Table 4, also comprising modifications to the V_(H) domain of the heavy chain and/or the V_(L) domain of the light chains. Non-limiting examples of design sets derived from such combinations of LCCA design sets are shown in Table 6:

TABLE 6 Design sets comprising variable domain modifications Set # Set # H1_mutation* L1_mutation H2_mutation L2_mutation V001 V002 V37W_W103H F98L V37I F98W V001 V003 V37W_W103H F98L WT# F98W V004 V005 V37W_W103H F98L V37A_W103H P44W V006 V002 V37W_W103F F98L V37I F98W V006 V003 V37W_W103F F98L WT F98W V007 V008 V37W F98A V37I F98W V009 V010 V37W F98A V37I WT V011 V012 V37I WT F100W F98L V013 V014 V37A_W103V P44W V37W F98A V015 V014 V37A_W103H P44W V37W F98A V016 V017 V37A_W103H P44W V37A_W103V F98W V005 V018 V37A_W103H P44W V37A_W103V F98L V005 V019 V37A_W103H P44W V37I_F100W F98L V020 V021 L45W Y87G V37A_W103H P44W V022 V007 WT F98W V37W F98A V023 V009 WT WT V37W F98A V024 V025 Q39R Q38E V37W F98A V026 V027 Q39R Q38E WT WT V028 V029 Q39R Q38E Q39E Q38R V030 V031 Q39R Q38D Q39E Q38R V032 V033 Q39M Q38M Q39R Q38E V034 V035 Q39K Q38N_T85E Q39D Q38N_T85K V034 V036 Q39K Q38N_T85E Q39E Q38N_T85K V037 V038 Q39E Q38R V37W F98A V039 V030 Q39D Q38R Q39R Q38D V040 V041 V37E L89R_F98T WT WT V042 V043 V37E_F100D L89R_F98W WT WT V044 V045 V37E_F100D L89R_F98W V37S_A97K F98Y *Kabat numbering; #WT refers to a wild-type immunoglobulin chain without amino acid mutations

Transferability of Specific Amino Acid Modifications Identified Herein to Other Antibodies:

Although the specific amino acid modifications to immunoglobulin heavy and light chains identified above have been described with respect to the D3H44 anti-tissue factor extracellular domain antibody immunoglobulin heavy and light chains, it is contemplated and demonstrated herein (see Examples, Figs, and Tables) that these amino acid modifications are transferable to other immunoglobulin heavy and light chains, resulting in similar patterns of preferential pairing of one immunoglobulin heavy chain with one of the two immunoglobulin light chains in view of the following.

The VH:VL and CH1:CL interface residues in the interface between immunoglobulin heavy and light chains are relatively well conserved (Padlan et al., 1986, Mol. Immunol. 23(9): 951-960). This sequence conservation, a result of evolutionary constraints, increases the likelihood that functionally active antibody binding domains will be formed during combinatorial pairing of light and heavy chains. As a result of this sequence conservation, it follows that sequence modifications in the specific examples noted above for D3H44, which drive preferential pairing, could transfer to other heavy and light chain pair heterodimers with approximately equivalent results being obtained with respect to preferential pairing, since this region displays high sequence conservation across antibodies; Further, when sequence differences do occur, these usually lie distal to the CH1:CL interface. This is particularly the case for the CH1 and CL domains. There is, however, some sequence variability at the antigen-binding site with respect to CDR (complementarity-determining regions) loop residues (and length), particularly for CDR-H3. Thus, in one embodiment, the heterodimer pairs according to the invention comprise heterodimers where at least one heterodimer comprises one or more amino acid modifications in the VH and/or VL domains that lie distal to the CDR loops when the amino acid sequence of the antigen-binding site is significantly different from that of the D3H44 antibody. In another embodiment, the heterodimer pairs according to the invention comprise heterodimers where at least one heterodimer comprises one or more amino acid modifications in the VH and/or VL domains that lie proximal or distal to the CDR loops, when the amino acid sequence of the antigen-binding site is substantially the same as that of the D3H44 antibody.

In one embodiment, the amino acid modifications described herein are transferable to the immunoglobulin heavy and light chains of antibodies based on human or humanized IgG1/K. Non-limiting examples of such IgG1/K chains include Ofatumumab (for human) or Trastuzumab, Pertuzumab or Bevacizumab (for humanized).

In another embodiment, the amino acid modifications described herein are transferable to the immunoglobulin heavy and light chains of antibodies utilizing commonly used VH and VL subgroups. Non-limiting examples of such antibodies include Peruzumab.

In one embodiment, the amino acid modifications described herein are transferable to the immunoglobulin heavy and light chains of antibodies having a framework close to germline. Examples of such antibodies include Obinutuzumab.

In one embodiment, the amino acid modifications described herein are transferable to the immunoglobulin heavy and light chains of antibodies having a VH:VL interdomain angle close to the average observed for heavy and light chain pairs. An example of this type of antibody includes, but is not limited to Pertuzumab. In another embodiment, the amino acid modifications described herein are transferable to the immunoglobulin heavy and light chains of antibodies having canonical CL and CH1 domains. Suitable examples of such antibodies include, but are not limited to Trastuzumab.

In some embodiments, certain subsets of the amino acid modifications described herein are utilized in variant domains in antigen binding constructs provided above.

The Examples, Figs, and Tables demonstrate that amino acid modifications (e.g., within one or more Fab fragments comprising a variable region and a constant region) are transferable to other immunoglobulin heavy and light chains, resulting in similar patterns of preferential pairing of one immunoglobulin heavy chain with one of the two immunoglobulin light chains.

Preferential Pairing

As described above, at least one heterodimer of the antigen binding construct/heterodimer pairs according to the invention can comprise one or more amino acid modifications to their immunoglobulin heavy and/or immunoglobulin light chains such that the heavy chain of the one heterodimer, for example H1, preferentially pairs with one of the light chains, for example L1, rather than the other light chain, L2, and the heavy chain of the other heterodimer, H2, preferentially pairs with the light chain, L2, rather than the light chain L1. In other words, the desired, preferential pairing is considered to be between H1 and L1, and between H2 and L2. Preferential pairing between, for example, H1 and L1 is considered to occur if the yield of the H1-L1 heterodimer is greater than the yield of the mispaired H1-L2 heterodimer when H1 is combined with L1 and L2, relative to the respective pairing of corresponding H1/L1 pair to H2/L2 pair without the one or more amino acid modifications. Likewise, preferential pairing between H2 and L2 is considered to occur if the yield of the H2-L2 heterodimer is greater than the yield of the mispaired H2-L1 heterodimer when H2 is combined with L1 and L2, relative to the respective pairing of corresponding H1-L1 pair to H2-L2 pair without the one or more amino acid modifications. In this context, an heterodimer comprising H1 and L1 (H1-L1), or H2 and L2 (H2-L2), is referred to herein as a preferentially paired, correctly paired, obligate pair, or desired heterodimer, while a heterodimer comprising H1 and L2 (H1-L2), or H2 and L1 (H2-L1), is referred to herein as a mispaired heterodimer. The set of mutations corresponding to the two heavy chains and the two light chains meant to achieve selective pairing of H1-L1 and H2-L2 is referred to as a design set.

Thus, in one embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 55%. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 60%. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 70%. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 80%. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 90%. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the relative yield of the desired heterodimer is greater than 95%.

In the above example, preferential pairing between H1-L1 is considered to occur if the amount of the desired H1-L1 heterodimer is greater than the amount of the mispaired H1-L2 heterodimer when H1 is co-expressed with L1 and L2. Similarly, preferential pairing between H2-L2 is considered to occur if the amount of the desired H2-L2 heterodimer is greater than the amount of the mispaired H2-L2 heterodimer when H2 is co-expressed with L1 and L2. Thus, in one embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 1.25:1. In one embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 1.5:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 2:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 3:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 5:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 10:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 25:1. In another embodiment, when one immunoglobulin heavy chain of a heterodimer is co-expressed with two immunoglobulin light chains, the ratio of the desired heterodimer to the mispaired heterodimer is greater than 50:1.

Thermal Stability of Heterodimers

In one embodiment, each heterodimer of the heterodimer pair according to the invention has a thermal stability that is comparable to that of a heterodimer comprising the same immunoglobulin heavy and light chains but without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In one embodiment, thermal stability is determined by measurement of melting temperature, or Tm. Thus, in one embodiment, the thermal stability of a heterodimer according to the invention is within about 10° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. Thus, in one embodiment, the thermal stability of a heterodimer according to the invention is within about 5° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 3° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 2° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 1.5° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 1° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 0.5° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, the thermal stability of a heterodimer according to the invention is within about 0.25° C. of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein.

Affinity of Heterodimers for Antigen

In one embodiment, each heterodimer of the heterodimer pair has an affinity for its respective antigen that is the same or comparable to that of a heterodimer comprising the same immunoglobulin heavy and light chains but without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In one embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 50 fold, or one order of magnitude, of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In one embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 25 fold, or one order of magnitude, of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In one embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 10 fold, or one order of magnitude, of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 5 fold of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 2.5 fold of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 2 fold of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about 1.5 fold of that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein. In another embodiment, a heterodimer of the heterodimer pair has an affinity for its respective antigen that is within about the same as that of a heterodimer comprising the same immunoglobulin heavy and light chains without the amino acid modifications to the CH1, CL, VH, or VL domains described herein.

Additional Optional Modifications

In one embodiment, the immunoglobulin heavy and light chains of the heterodimer pairs according to the invention may be further modified (i.e., by the covalent attachment of various types of molecules) such that covalent attachment does not interfere with the preferential pairing between heavy chain and light chains or affect the ability of the heterodimer to bind to its antigen, or affect its stability. Such modification include, for example, but not by way of limitation, glycosylation, acetylation, pegylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to a cellular ligand or other protein, etc. Any of numerous chemical modifications may be carried out by known techniques, including, but not limited to, specific chemical cleavage, acetylation, formylation, metabolic synthesis of tunicamycin, etc.

In another embodiment, the immunoglobulin heavy and light chains of the heterodimer pairs according to the invention may be conjugated (directly or indirectly) to a therapeutic agent or drug moiety that modifies a given biological response. Therapeutic agents or drug moieties are not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for example, a toxin such as abrin, ricin A, Onconase (or another cytotoxic RNase), pseudomonas exotoxin, cholera toxin, or diphtheria toxin; a protein such as tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth factor, tissue plasminogen activator, an apoptotic agent, e.g., TNF-alpha, TNF-beta, AIM I (see, International Publication No. WO 97/33899), AIM II (see, International Publication No. WO 97/34911), Fas Ligand (Takahashi et al., 1994, J. Immunol., 6:1567), and VEGI (see, International Publication No. WO 99/23105), a thrombotic agent or an anti-angiogenic agent, e.g., angiostatin or endostatin; or, a biological response modifier such as, for example, a lymphokine (e.g., interleukin-1 (“IL-1”), interleukin-2 (“IL-2”), interleukin-6 (“IL-6”), granulocyte macrophage colony stimulating factor (“GM-CSF”), and granulocyte colony stimulating factor (“G-CSF”)), or a growth factor (e.g., growth hormone (“GH”)).

Moreover, in an alternate embodiment, an antibody can be conjugated to therapeutic moieties such as a radioactive materials or macrocyclic chelators useful for conjugating radiometal ions (see above for examples of radioactive materials). In certain embodiments, the macrocyclic chelator is 1,4,7,10-tetraazacyclododecane-N,N′,N″,N″-tetraacetic acid (DOTA) which can be attached to the antibody via a linker molecule. Such linker molecules are commonly known in the art and described in Denardo et al., 1998, Clin Cancer Res. 4:2483; Peterson et al., 1999, Bioconjug. Chem. 10:553; and Zimmerman et al., 1999, Nucl. Med. Biol. 26:943.

In some embodiments, the immunoglobulin heavy and light chains of the heterodimer are expressed as fusion proteins comprising a tag to facilitate purification and/or testing etc. As referred to herein, a “tag” is any added series of amino acids which are provided in a protein at either the C-terminus, the N-terminus, or internally that contributes to the identification or purification of the protein. Suitable tags include but are not limited to tags known to those skilled in the art to be useful in purification and/or testing such as albumin binding domain (ABD), His tag, FLAG tag, glutathione-s-transferase, haemaglutinin (HA) and maltose binding protein. Such tagged proteins may also be engineered to comprise a cleavage site, such as a thrombin, enterokinase or factor X cleavage site, for ease of removal, of the tag before, during or after purification.

In some embodiments, one or more of the cysteine residues at the bottom of the Fab domain in the light (position 214, Kabat numbering) and heavy (position 233, Kabat numbering) chain that form an interchain disulphide bond can be modified to serine or alanine or a non-cysteine or a distinct amino acid.

It is contemplated that additional amino acid modifications can be made to the immunoglobulin heavy chains in order to increase the level of preferential pairing, and/or the thermal stability of the heterodimer pairs. For example, additional amino acid modifications can be made to the immunoglobulin heavy chain Fc domain in order to drive preferential pairing between heterodimer pairs relative to homodimer pairs. Such amino acid modifications are known in the art and include, for example, those described, in US Patent Publication No. 2012/0149876. Alternatively, alternate strategies for driving preferential pairing between heterodimer pairs relative to homodimer pairs such as, for example, “knobs into holes”, charged residues with ionic interactions, and strand-exchange engineered domain (SEED) technologies can also be employed. The latter strategies have been described in the art and are reviewed in Klein et al, supra. Further discussion of Fc domains follows below.

Fc Domains

The constructs described herein can further include an Fc. In some aspects, the Fc comprises at least one or two C_(H3) domain sequences. In some aspects, the Fc is coupled, with or without one or more linkers, to a first heterodimer and/or a second heterodimer. In some aspects, the Fc is a human Fc. In some aspects, the Fc is a human IgG or IgG1 Fc. In some aspects, the Fc is a heterodimeric Fc. In some aspects, the Fc comprises at least one or two C_(H2) domain sequences.

In some aspects, the Fc comprises one or more modifications in at least one of the C_(H3) domain sequences. In some aspects, the Fc comprises one or more modifications in at least one of the C_(H2) domain sequences. In some aspects, an Fc is a single polypeptide. In some aspects, an Fc is multiple peptides, e.g., two polypeptides.

In some aspects, the Fc comprises one or more modifications in at least one of the C_(H3) sequences. In some aspects, the Fc comprises one or more modifications in at least one of the C_(H2) sequences. In some aspects, an Fc is a single polypeptide. In some aspects, an Fc is multiple peptides, e.g., two polypeptides.

In some aspects, Fc is an Fc described in patent applications PCT/CA2011/001238, filed Nov. 4, 2011 or PCT/CA2012/050780, filed Nov. 2, 2012, the entire disclosure of each of which is hereby incorporated by reference in its entirety for all purposes.

In some aspects, a construct described herein comprises a heterodimeric Fc comprising a modified CH3 domain that has been asymmetrically modified. The heterodimeric Fc can comprise two heavy chain constant domain polypeptides: a first heavy chain polypeptide and a second heavy chain polypeptide, which can be used interchangeably provided that Fc comprises one first heavy chain polypeptide and one second heavy chain polypeptide. Generally, the first heavy chain polypeptide comprises a first CH3 sequence and the second heavy chain polypeptide comprises a second CH3 sequence.

Two CH3 sequences that comprise one or more amino acid modifications introduced in an asymmetric fashion generally results in a heterodimeric Fc, rather than a homodimer, when the two CH3 sequences dimerize. As used herein, “asymmetric amino acid modifications” refers to any modification where an amino acid at a specific position on a first CH3 sequence is different from the amino acid on a second CH3 sequence at the same position, and the first and second CH3 sequence preferentially pair to form a heterodimer, rather than a homodimer. This heterodimerization can be a result of modification of only one of the two amino acids at the same respective amino acid position on each sequence; or modification of both amino acids on each sequence at the same respective position on each of the first and second CH3 sequences. The first and second CH3 sequence of a heterodimeric Fc can comprise one or more than one asymmetric amino acid modification.

Table X provides the amino acid sequence of the human IgG1 Fc sequence, corresponding to amino acids 231 to 447 of the full-length human IgG1 heavy chain. The CH3 sequence comprises amino acid 341-447 of the full-length human IgG1 heavy chain.

Typically an Fc can include two contiguous heavy chain sequences (A and B) that are capable of dimerizing. In some aspects, one or both sequences of an Fc include one or more mutations or modifications at the following locations: L351, F405, Y407, T366, K392, T394, T350, S400, and/or N390, using EU numbering. In some aspects, an Fc includes a mutant sequence shown in Table X. In some aspects, an Fc includes the mutations of Variant 1 A-B. In some aspects, an Fc includes the mutations of Variant 2 A-B. In some aspects, an Fc includes the mutations of Variant 3 A-B. In some aspects, an Fc includes the mutations of Variant 4 A-B. In some aspects, an Fc includes the mutations of Variant 5 A-B.

TABLE X Human IgG1 Fc APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD sequence SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYR 231-447 VVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI (EU-numbering) SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKG FYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSL SLSPGK (SEQ ID NO: 1) Variant IgG1 Fc sequence (231-447) Chain Mutations 1 A L351Y_F405A_Y407V 1 B T366L_K392M_T394W 2 A L351Y_F405A_Y407V 2 B T366L_K392L_T394W 3 A T350V_L351Y_F405A_Y407V 3 B T350V_T366L_K392L_T394W 4 A T350V_L351Y_F405A_Y407V 4 B T350V_T366L_K392M_T394W 5 A T350V_L351Y_S400E_F405A_Y407V 5 B T350V_T366L_N390R_K392M_T394W

The first and second CH3 sequences can comprise amino acid mutations as described herein, with reference to amino acids 231 to 447 of the full-length human IgG1 heavy chain. In one embodiment, the heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions F405 and Y407, and a second CH3 sequence having amino acid modifications at position T394. In one embodiment, the heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having one or more amino acid modifications selected from L351Y, F405A, and Y407V, and the second CH3 sequence having one or more amino acid modifications selected from T366L, T366I, K392L, K392M, and T394W.

In one embodiment, a heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions L351, F405 and Y407, and a second CH3 sequence having amino acid modifications at positions T366, K392, and T394, and one of the first or second CH3 sequences further comprising amino acid modifications at position Q347, and the other CH3 sequence further comprising amino acid modification at position K360. In another embodiment, a heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions L351, F405 and Y407, and a second CH3 sequence having amino acid modifications at position T366, K392, and T394, one of the first or second CH3 sequences further comprising amino acid modifications at position Q347, and the other CH3 sequence further comprising amino acid modification at position K360, and one or both of said CH3 sequences further comprise the amino acid modification T350V.

In one embodiment, a heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions L351, F405 and Y407, and a second CH3 sequence having amino acid modifications at positions T366, K392, and T394 and one of said first and second CH3 sequences further comprising amino acid modification of D399R or D399K and the other CH3 sequence comprising one or more of T411E, T411D, K409E, K409D, K392E and K392D. In another embodiment, a heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions L351, F405 and Y407, and a second CH3 sequence having amino acid modifications at positions T366, K392, and T394, one of said first and second CH3 sequences further comprises amino acid modification of D399R or D399K and the other CH3 sequence comprising one or more of T411E, T411D, K409E, K409D, K392E and K392D, and one or both of said CH3 sequences further comprise the amino acid modification T350V.

In one embodiment, a heterodimeric Fc comprises a modified CH3 domain with a first CH3 sequence having amino acid modifications at positions L351, F405 and Y407, and a second CH3 sequence having amino acid modifications at positions T366, K392, and T394, wherein one or both of said CH3 sequences further comprise the amino acid modification of T350V.

In one embodiment, a heterodimeric Fc comprises a modified CH3 domain comprising the following amino acid modifications, where “A” represents the amino acid modifications to the first CH3 sequence, and “B” represents the amino acid modifications to the second CH3 sequence: A:L351Y_F405A_Y407V, B:T366L_K392M_T394W, A:L351Y_F405A_Y407V, B:T366L_K392L_T394W, A:T350V_L351Y_F405A_Y407V, B:T350V_T366L_K392L_T394W, A:T350V_L351Y_S400E_F405A_Y407V, B:T350V_T366L_K392M_T394W, A:T350V_L351Y_S400E_F405A_Y407V, and/or B:T350V_T366L_N390R_K392M_T394W.

The one or more asymmetric amino acid modifications can promote the formation of a heterodimeric Fc in which the heterodimeric CH3 domain has a stability that is comparable to a wild-type homodimeric CH3 domain. In an embodiment, the one or more asymmetric amino acid modifications promote the formation of a heterodimeric Fc domain in which the heterodimeric Fc domain has a stability that is comparable to a wild-type homodimeric Fc domain. In an embodiment, the one or more asymmetric amino acid modifications promote the formation of a heterodimeric Fc domain in which the heterodimeric Fc domain has a stability observed via the melting temperature (Tm) in a differential scanning calorimetry study, and where the melting temperature is within 4° C. of that observed for the corresponding symmetric wild-type homodimeric Fc domain. In some aspects, the Fc comprises one or more modifications in at least one of the CH3 sequences that promote the formation of a heterodimeric Fc with stability comparable to a wild-type homodimeric Fc.

In one embodiment, the stability of the CH3 domain can be assessed by measuring the melting temperature of the CH3 domain, for example by differential scanning calorimetry (DSC). Thus, in a further embodiment, the CH3 domain has a melting temperature of about 68° C. or higher. In another embodiment, the CH3 domain has a melting temperature of about 70° C. or higher. In another embodiment, the CH3 domain has a melting temperature of about 72° C. or higher. In another embodiment, the CH3 domain has a melting temperature of about 73° C. or higher. In another embodiment, the CH3 domain has a melting temperature of about 75° C. or higher. In another embodiment, the CH3 domain has a melting temperature of about 78° C. or higher. In some aspects, the dimerized C_(H3) sequences have a melting temperature (Tm) of about 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 77.5, 78, 79, 80, 81, 82, 83, 84, or 85° C. or higher.

In some embodiments, a heterodimeric Fc comprising modified CH3 sequences can be formed with a purity of at least about 75% as compared to homodimeric Fc in the expressed product. In another embodiment, the heterodimeric Fc is formed with a purity greater than about 80%. In another embodiment, the heterodimeric Fc is formed with a purity greater than about 85%. In another embodiment, the heterodimeric Fc is formed with a purity greater than about 90%. In another embodiment, the heterodimeric Fc is formed with a purity greater than about 95%. In another embodiment, the heterodimeric Fc is formed with a purity greater than about 97%. In some aspects, the Fc is a heterodimer formed with a purity greater than about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% when expressed. In some aspects, the Fc is a heterodimer formed with a purity greater than about 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% when expressed via a single cell.

Additional methods for modifying monomeric Fc polypeptides to promote heterodimeric Fc formation are described in International Patent Publication No. WO 96/027011 (knobs into holes), in Gunasekaran et al. (Gunasekaran K. et al. (2010) J Biol Chem. 285, 19637-46, electrostatic design to achieve selective heterodimerization), in Davis et al. (Davis, J H. et al. (2010) Prot Eng Des Sel; 23(4): 195-202, strand exchange engineered domain (SEED) technology), and in Labrijn et al [Efficient generation of stable bispecific IgG1 by controlled Fab-arm exchange. Labrijn A F, Meesters J I, de Goeij B E, van den Bremer E T, Neijssen J, van Kampen M D, Strumane K, Verploegen S, Kundu A, Gramer M J, van Berkel P H, van de Winkel J G, Schuurman J, Parren P W. Proc Natl Acad Sci USA. 2013 Mar. 26; 110(13):5145-50.

In some embodiments an isolated construct described herein comprises an antigen binding construct which binds an antigen; and a dimeric Fc polypeptide construct that has superior biophysical properties like stability and ease of manufacture relative to an antigen binding construct which does not include the same Fc polypeptide. A number of mutations in the heavy chain sequence of the Fc are known in the art for selectively altering the affinity of the antibody Fc for the different Fcgamma receptors. In some aspects, the Fc comprises one or more modifications to promote selective binding of Fc-gamma receptors.

The CH2 domain is amino acid 231-340 of the sequence shown in Table X. Exemplary mutations are listed below:

S298A/E333A/K334A, S298A/E333A/K334A/K326A (Lu Y, Vernes J M, Chiang N, et al. J Immunol Methods. 2011 Feb. 28; 365(1-2):132-41);

F243 L/R292P/Y300L/V305I/P396L, F243 L/R292P/Y300L/L235V/P396L (Stavenhagen Y J B, Goriatov S, Tuaillon N, et al. Cancer Res. 2007 Sep. 15; 67(18):8882-90; Nordstrom J L, Gorlatov S, Zhang W, et al, Breast Cancer Res. 2011 Nov. 30; 13(6):R123);

F243L (Stewart R, Thorn G, Levens M, et al. Protein Eng Des Sel. 2011 September; 24(9):671-8.), S298A/E333A/K334A (Shields R L, Namenuk A K, Hong K, et al. J Biol Chem. 2001 Mar. 2; 276(9):6591-604);

S239D/I332E/A330L, S239D/I332E (Lazar G A, Dang W, Karki S, et al. Proc Natl Acad Sci USA. 2006 Mar. 14; 103(l 1):4005-10);

S239D/S267E, S267E/L328F (Chu S Y, Vostiar I, Karki S, et al. Mol Immunol. 2008 September; 45(15):3926-33);

S239D/D265S/S298A1332E, S239E/S298A/K326A/A327H, G237F/S298A/A330L/I332E, S239D/1332E/S298A, S239D/K326E/A330L/I332E/S298A, G236A/S239D/D270L/I332E, S239E/S267E/H268D, L234F/S267E/N325L, G237F/V266L/S267D and other mutations listed in WO2011/120134 and WO2011/120135, herein incorporated by reference. Therapeutic Antibody Engineering (by William R. Strohl and Lila M. Strohl, Woodhead Publishing series in Biomedicine No 11, ISBN 1 907568 37 9, October 2012) lists mutations on page 283.

In some embodiments a CH2 domain comprises one or more asymmetric amino acid modifications. In some embodiments a CH2 domain comprises one or more asymmetric amino acid modifications to promote selective binding of a FcγR. In some embodiments the CH2 domain allows for separation and purification of an isolated construct described herein.

FcRn Binding and PK Parameters

As is known in the art, binding to FcRn recycles endocytosed antibody from the endosome back to the bloodstream (Raghavan et al., 1996, Annu Rev Cell Dev Biol 12:181-220; Ghetie et al., 2000, Annu Rev Immunol 18:739-766). This process, coupled with preclusion of kidney filtration due to the large size of the full-length molecule, results in favorable antibody serum half-lives ranging from one to three weeks. Binding of Fc to FcRn also plays a key role in antibody transport. Thus, in one embodiment, the constructs of the invention are able to bind FcRn.

Additional Modifications to Improve Effector Function.

In some embodiments a construct described herein can be modified to improve its effector function. Such modifications are known in the art and include afucosylation, or engineering of the affinity of the Fc portion of antibodies towards an activating receptor, mainly FCGR3a for ADCC, and towards C1q for CDC. The following Table Y summarizes various designs reported in the literature for effector function engineering.

TABLE Y Reference Mutations Effect Lu, 2011, Ferrara Afucosylated Increased 2011, Mizushima 2011 ADCC Lu, 2011 S298A/E333A/K334A Increased ADCC Lu, 2011 S298A/E333A/K334A/K326A Increased ADCC Stavenhagen, 2007 F243L/R292P/Y300L/V305I/P396L Increased ADCC Nordstrom, 2011 F243L/R292P/Y300L/L235V/P396L Increased ADCC Stewart, 2011 F243L Increased ADCC Shields, 2001 S298A/E333A/K334A Increased ADCC Lazar, 2006 S239D/I332E/A330L Increased ADCC Lazar, 2006 S239D/I332E Increased ADCC Bowles, 2006 AME-D, not specified Increased mutations ADCC Heider, 2011 37.1, mutations not Increased disclosed ADCC Moore, 2010 S267E/H268F/S324T Increased CDC

Thus, in one embodiment, a construct described herein can include a dimeric Fc that comprises one or more amino acid modifications as noted in the above table that confer improved effector function. In another embodiment, the construct can be afucosylated to improve effector function.

Linkers

The constructs described herein can include one or more heterodimers described herein operatively coupled to an Fc described herein. In some aspects, Fc is coupled to the one or more heterodimers with or without one or more linkers. In some aspects, Fc is directly coupled to the one or more heterodimers. In some aspects, Fc is coupled to the one or more heterodimers by one or more linkers. In some aspects, Fc is coupled to the heavy chain of each heterodimer by a linker.

In some aspects, the one or more linkers are one or more polypeptide linkers. In some aspects, the one or more linkers comprise one or more antibody hinge regions. In some aspects, the one or more linkers comprise one or more IgG1 hinge regions.

Methods of Preparing Heterodimer Pairs

As described above, the heterodimer pairs according to the invention can comprise a first heterodimer and a second heterodimer, each heterodimer comprising an immunoglobulin heavy chain or fragment thereof having at least a VH and CH1 domain, and an immunoglobulin light chain having a VL domain and a CL domain. The immunoglobulin heavy chains and immunoglobulin light chains of the heterodimer can readily be prepared using recombinant DNA technology known in the art. Standard techniques such as, for example, those described in Sambrook and Russell, Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 3rd ed., 2001); Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2nd ed., 1989); Short Protocols in Molecular Biology (Ausubel et al., John Wiley and Sons, New York, 4th ed., 1999); and Glick and Pasternak, Molecular Biotechnology: Principles and Applications of Recombinant DNA (ASM Press, Washington, D.C., 2nd ed., 1998) can be used for recombinant nucleic acid methods, nucleic acid synthesis, cell culture, transgene incorporation, and recombinant protein expression. Alternatively, the heterodimers and heterodimer pairs according to the invention can be chemically synthesized.

The nucleic acid and amino acid sequences of the immunoglobulin heavy and light chains of the antibodies from which the heterodimers are derived are either known in the art or can be readily determined using nucleic acid and/or protein sequencing methods. Methods of genetically fusing the tags described herein to the immunoglobulin heavy and/or light chains are known in the art, and some are described below and in the Examples.

For example, methods of expressing and co-expressing immunoglobulin heavy and light chains in a host cell are well known in the art. In addition, methods of tagging heavy chains and/or light chains using recombinant DNA technology are also well known in the art. Expression vectors and host cells suitable for expression of the heavy and light chains are also well known in the art as described below.

Bispecific antibody production methods that do not rely on the use only a single clonal or transient cell line expressing all four chains are known in the art (Gramer, et al. (2013) mAbs 5, 962; Strop et al. (2012) J Mol Biol 420, 204.). These methods rely on a post production arm exchange under redox conditions of the two pairs of light and heavy chain involved in the formation of bispecific antibody (Redox production). In this approach the H1:L1 and H2:L2 pairs can be expressed in two different cell lines to independently produce the two Fab arms. Subsequently, the two Fab arms are mixed under select redox conditions to achieve re-association of the two unique heavy chain H1 and H2 to form the bispecific antibody comprising L1:H1:H2:L2 chains. One can envision the use of the library/dataset of designs described herein in the production of bispecific antibodies using the Redox production method or modified versions of that method.

In certain embodiments, cell-free protein expression systems are utilized to co-express polypeptides (e.g., heavy and light chain polypeptides) without the use of living cells. Instead, all components needed to transcribe DNA to RNA and translate the RNA to protein (e.g. ribosomes, tRNAs, enzymes, cofactors, amino acids) are provided in solution for use in vitro. In certain embodiments, the in vitro expression requires (1) the genetic template (mRNA or DNA) encoding the heavy and light chain polypeptides and (2) a reaction solution containing the necessary transcriptional and translational molecular machinery. In certain embodiments, cell extracts substantially supply components of the reaction solution, for instance: RNA polymerases for mRNA transcription, ribosomes for polypeptide translation, tRNA, amino acids, enzymatic cofactors, an energy source, and cellular components essential for proper protein folding. Cell-free protein expression systems can be prepared using lysates derived from bacterial cells, yeast cells, insect cells, plant cells, mammalian cells, human cells or combinations thereof. Such cell lysates can provide the correct composition and proportion of enzymes and building blocks required for translation. In some embodiments, cell membranes are removed to leave only the cytosolic and organelle components of the cell.

Several cell-free protein expression systems are known in the art as reviewed in Carlson et al. (2012) Biotechnol. Adv. 30:1185-1194. For example, cell-free protein expression systems are available based on prokaryotic or eukaryotic cells. Examples of prokaryotic cell-free expression systems include those from E. coli. Eukaryotic cell-free protein expression systems are available based on extracts from rabbit reticulocytes, wheat germ, and insect cells, for example. Such prokaryotic and eukaryotic cell-free protein expression systems are commercially available from companies such as Roche, Invitrogen, Qiagen, and Novagen. One skilled in the art would readily be able to select suitable cell-free protein expression systems that would produce polypeptides (e.g., heavy chain and light chain polypeptides) that are capable of pairing with each other. Further, the cell-free protein expression system can also be supplemented with chaperones (e.g. BiP) and isomerases (e.g. disulphide isomerase) to improve the efficiency of IgG folding.

In some embodiments, cell-free expression systems are utilized to co-express the heavy and light chain polypeptides from DNA templates (transcription and translation) or mRNA templates (translation only).

Vectors and Host Cells

Recombinant expression of heavy and light chains requires construction of an expression vector containing a polynucleotide that encodes the heavy or light chain (e.g., antibody, or fusion protein). Once a polynucleotide encoding the heavy or light chain has been obtained, the vector for the production of the heavy or light chain may be produced by recombinant DNA technology using techniques well known in the art. Thus, methods for preparing a protein by expressing a polynucleotide containing the heavy or light chain encoding nucleotide sequence are described herein. Methods that are well known to those skilled in the art can be used to construct expression vectors containing heavy or light chain coding sequences and appropriate transcriptional and translational control signals. These methods include, for example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. The invention, thus, provides replicable vectors comprising a nucleotide sequence encoding heavy or light chains, operably linked to a promoter.

The expression vector is transferred to a host cell by conventional techniques and the transfected cells are then cultured by conventional techniques to produce the modified heavy or light chains for use in the method of the invention. In specific embodiments the heavy and light chains for use in the method are co-expressed in the host cell for expression of the entire immunoglobulin molecule, as detailed below.

A variety of host-expression vector systems may be utilized to express the modified heavy and light chains. Such host-expression systems represent vehicles by which the coding sequences of interest may be produced and subsequently purified, but also represent cells which may, when transformed or transfected with the appropriate nucleotide coding sequences, express the modified heavy and light chains in situ. These include but are not limited to microorganisms such as bacteria (e.g., E. coli and B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing the modified heavy and light chain coding sequences; yeast (e.g., Saccharomyces Pichia) transformed with recombinant yeast expression vectors containing modified heavy and light chain coding sequences; insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) containing modified heavy and light chain coding sequences; plant cell systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing modified heavy and light chain coding sequences; or mammalian cell systems (e.g., COS, CHO, BHK, HEK-293, NSO, and 3T3 cells) harboring recombinant expression constructs containing promoters derived from the genome of mammalian cells (e.g., metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter). In certain embodiments, bacterial cells such as Escherichia coli, or eukaryotic cells, are used for the expression of modified heavy and light chains, which is a recombinant antibody or fusion protein molecules. For example, mammalian cells such as Chinese hamster ovary cells (CHO), in conjunction with a vector such as the major intermediate early gene promoter element from human cytomegalovirus is an effective expression system for antibodies (Foecking et al., 1986, Gene 45:101; and Cockett et al., 1990, Bio/Technology 8:2). In a specific embodiment, the expression of nucleotide sequences encoding the immunoglobulin heavy and light chains of each heterodimer is regulated by a constitutive promoter, inducible promoter or tissue specific promoter.

In mammalian host cells, a number of viral-based expression systems may be utilized. In cases where an adenovirus is used as an expression vector, the modified heavy and light chain coding sequences of interest may be ligated to an adenovirus transcription/translation control complex, e.g., the late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region E1 or E3) will result in a recombinant virus that is viable and capable of expressing the modified heavy and light chains in infected hosts (e.g., see Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA 81:355-359). Specific initiation signals may also be required for efficient translation of inserted antibody coding sequences. These signals include the ATG initiation codon and adjacent sequences. Furthermore, the initiation codon must be in phase with the reading frame of the desired coding sequence to ensure translation of the entire insert. These exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see, e.g., Bittner et al., 1987, Methods in Enzymol. 153:516-544).

The expression of the immunoglobulin heavy and light chains of the heterodimers may be controlled by any promoter or enhancer element known in the art. Promoters which may be used to control the expression of the gene encoding modified heavy and light chains (e.g., antibody or fusion protein) include, but are not limited to, the SV40 early promoter region (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3′ long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78.1441-1445), the regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39-42), the tetracycline (Tet) promoter (Gossen et al., 1995, Proc. Nat. Acad. Sci. USA 89:5547-5551); prokaryotic expression vectors such as the p3-lactamase promoter (Villa-Kamaroff et al, 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:21-25; see also “Useful proteins from recombinant bacteria” in Scientific American, 1980, 242:74-94); plant expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et al., Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner et al., 1981, Nucl. Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella et al., 1984, Nature 310:115-120); promoter elements from yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, alkaline phosphatase promoter, and the following animal transcriptional control regions, which exhibit tissue specificity and have been utilized in transgenic animals: elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, Cell 38:639-646; Ornitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in pancreatic beta cells (Hanahan, 1985, Nature 315:115-122), immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), mouse mammary tumor virus control region which is active in testicular, breast, lymphoid and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; Kollias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin light chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 314:283-286); neuronal-specific enolase (NSE) which is active in neuronal cells (Morelli et al., 1999, Gen. Virol. 80:571-83); brain-derived neurotrophic factor (BDNF) gene control region which is active in neuronal cells (Tabuchi et al., 1998, Biochem. Biophysic. Res. Com. 253:818-823); glial fibrillary acidic protein (GFAP) promoter which is active in astrocytes (Gomes et al., 1999, Braz J Med Biol Res 32(5): 619-631; Morelli et al., 1999, Gen. Virol. 80:571-83) and gonadotropic releasing hormone gene control region which is active in the hypothalamus (Mason et al., 1986, Science 234:1372-1378).

In addition, a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Expression from certain promoters can be elevated in the presence of certain inducers; thus, expression of the genetically engineered fusion protein may be controlled. Furthermore, different host cells have characteristic and specific mechanisms for the translational and post-translational processing and modification (e.g., glycosylation, phosphorylation of proteins). Appropriate cell lines or host systems can be chosen to ensure the desired modification and processing of the foreign protein expressed. For example, expression in a bacterial system will produce an unglycosylated product and expression in yeast will produce a glycosylated product. Eukaryotic host cells that possess the cellular machinery for proper processing of the primary transcript (e.g., glycosylation, and phosphorylation) of the gene product may be used. Such mammalian host cells include, but are not limited to, CHO, VERY, BHK, Hela, COS, MDCK, HEK-293, 3T3, WI38, NSO, and in particular, neuronal cell lines such as, for example, SK-N-AS, SK-N-FI, SK-N-DZ human neuroblastomas (Sugimoto et al., 1984, J. Natl. Cancer Inst. 73: 51-57), SK-N-SH human neuroblastoma (Biochim. Biophys. Acta, 1982, 704: 450-460), Daoy human cerebellar medulloblastoma (He et al., 1992, Cancer Res. 52: 1144-1148) DBTRG-05MG glioblastoma cells (Kruse et al., 1992, In Vitro Cell. Dev. Biol. 28A: 609-614), IMR-32 human neuroblastoma (Cancer Res., 1970, 30: 2110-2118), 1321 Ni human astrocytoma (Proc. Natl. Acad. Sci. USA, 1977, 74: 4816), MOG-G-CCM human astrocytoma (Br. J. Cancer, 1984, 49: 269), U87MG human glioblastoma-astrocytoma (Acta Pathol. Microbiol. Scand., 1968, 74: 465-486), A172 human glioblastoma (Olopade et al., 1992, Cancer Res. 52: 2523-2529), C6 rat glioma cells (Benda et al., 1968, Science 161: 370-371), Neuro-2a mouse neuroblastoma (Proc. Natl. Acad. Sci. USA, 1970, 65: 129-136), NB41A3 mouse neuroblastoma (Proc. Natl. Acad. Sci. USA, 1962, 48: 1184-1190), SCP sheep choroid plexus (Bolin et al., 1994, J. Virol. Methods 48: 211-221), G355-5, PG-4 Cat normal astrocyte (Haapala et al., 1985, J. Virol. 53: 827-833), Mpf ferret brain (Trowbridge et al., 1982, In Vitro 18: 952-960), and normal cell lines such as, for example, CTX TNA2 rat normal cortex brain (Radany et al., 1992, Proc. Natl. Acad. Sci. USA 89: 6467-6471) such as, for example, CRL7030 and Hs578Bst. Furthermore, different vector/host expression systems may effect processing reactions to different extents.

For long-term, high-yield production of recombinant proteins, stable expression is often preferred. For example, cell lines that stably express the modified heavy and light chains of the invention (e.g., antibody or fusion protein) may be engineered. Rather than using expression vectors that contain viral origins of replication, host cells can be transformed with DNA controlled by appropriate expression control elements (e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following the introduction of the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched medium, and then are switched to a selective medium. The selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and grow to form foci that in turn can be cloned and expanded into cell lines.

A number of selection systems may be used, including but not limited to the herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 11:223), hypoxanthine-guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl. Acad. Sci. USA 48:2026), and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 22:817) genes can be employed in tk-, hgprt- or aprt-cells, respectively. Also, antimetabolite resistance can be used as the basis of selection for dhfr, which confers resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers resistance to the aminoglycoside G-418 (Colberre-Garapin et al., 1981, J. Mol. Biol. 150:1); and hygro, which confers resistance to hygromycin (Santerre et al., 1984, Gene 30:147) genes.

Co-Expression of Heavy Chains and Light Chains

The immunoglobulin heavy chains and light chains of the heterodimer pairs according to the invention can be co-expressed in mammalian cells, as noted above. In one embodiment, one heavy chain is co-expressed with two different light chains in a LCCA design set as described above, where the heavy chain preferentially pairs with one of the two light chains. In another embodiment, two heavy chains are co-expressed with two different light chains, where each heavy chain preferentially pairs with one of the light chains.

Testing of Heterodimer Pairs

As described above, at least one heterodimer of the heterodimer pairs according to the invention can comprise one or more amino acid modifications to their immunoglobulin heavy and/or immunoglobulin light chains such that when the two heavy chains and two light chains of the heterodimer pair are co-expressed in a mammalian cell, the heavy chain of the first heterodimer preferentially pairs with one of the light chains rather than the other. Likewise, the heavy chain of the second heterodimer preferentially pairs with the second light chain rather than the first. The degree of preferential pairing can be assessed, for example, by using the methods described below. The affinity of each heterodimer of the heterodimer pair for its respective antigen can be tested as described below. The thermal stability of each heterodimer of the heterodimer pair can also be tested as described below.

Methods to Measure Preferential Pairing

LCCA

In one embodiment, preferential pairing between immunoglobulin heavy and light chains is determined by performing a Light Chain Competition Assay (LCCA). Co-owned patent application PCT/US2013/063306, filed Oct. 3, 2013, describes various embodiments of LCCA and is herein incorporated by reference in its entirety for all purposes. The method allows quantitative analysis of the pairing of heavy chains with specific light chains within the mixture of co-expressed proteins and can be used to determine if one particular immunoglobulin heavy chain selectively associates with either one of two immunoglobulin light chains when the heavy chain and light chains are co-expressed. The method is briefly described as follows: At least one heavy chain and two different light chains are co-expressed in a cell, in ratios such that the heavy chain is the limiting pairing reactant; optionally separating the secreted proteins from the cell; separating the immunoglobulin light chain polypeptides bound to heavy chain from the rest of the secreted proteins to produce an isolated heavy chain paired fraction; detecting the amount of each different light chain in the isolated heavy chain fraction; and analyzing the relative amount of each different light chain in the isolated heavy chain fraction to determine the ability of the at least one heavy chain to selectively pair with one of the light chains.

The method provides reasonable throughput and is robust (i.e. insensitive to minor changes in operation, such as user or flow rate) and accurate. The method provides a sensitive assay that can measure the effects of small variations in the protein sequences. Promiscuous protein-protein; domain-domain; chain-chain interactions over large surface areas usually require multiple mutations (swaps) in order to introduce selectivity. The protein products do not need to be isolated and purified which enables more efficient screening. Further details regarding an embodiment of this method are described in the Examples.

Alternative Methods to Determine Preferential Pairing

Alternative methods for detecting preferential pairing include using LC-MS (Liquid chromatography-Mass spectrometry) to quantify the relative heterodimer populations including each light chain using differences in their molecular weight to identify each distinct species. An antigen activity assay could also be used to quantify relative heterodimer populations containing each light chain whereby the degree of binding measured (relative to controls) would be used to estimate each respective heterodimer population.

Additional methods such as SMCA are described in the Examples, Figs, and Tables.

Thermal Stability

The thermal stability of the heterodimers can be determined according to methods known in the art. The melting temperature of each heterodimer is indicative of its thermal stability. The melting point of the heterodimer may be measured using techniques such as differential scanning calorimetry (Chen et al (2003) Pharm Res 20:1952-60; Ghirlando et al (1999) Immunol Lett 68:47-52). Alternatively, the thermal stability of the heterodimer may be measured using circular dichroism (Murray et al. (2002) J. Chromatogr Sci 40:343-9).

Affinity for Antigen

The binding affinity of the heterodimers for their respective antigens and the off-rate of the interaction can be determined by competitive binding assays according to methods well known in the art. One example of a competitive binding assay is a radioimmunoassay comprising the incubation of labeled antigen (e.g., 3H or 1251 with a molecule of interest (e.g., heterodimers of the present invention) in the presence of increasing amounts of unlabeled antigen, and the detection of the molecule bound to the labeled ligand. The affinity of the heterodimer of the present invention for the antigen and the binding off-rates can be determined from the saturation data by Scatchard analysis.

The kinetic parameters of a heterodimer according to the invention may also be determined using surface plasmon resonance (SPR) based assays known in the art (e.g., BIAcore kinetic analysis). For a review of SPR-based technology see Mullet et al., 2000, Methods 22: 77-91; Dong et al., 2002, Review in Mol. Biotech., 82: 303-23; Fivash et al., 1998, Current Opinion in Biotechnology 9: 97-101; Rich et al., 2000, Current Opinion in Biotechnology 11: 54-61. Additionally, any of the SPR instruments and SPR based methods for measuring protein-protein interactions described in U.S. Pat. Nos. 6,373,577; 6,289,286; 5,322,798; 5,341,215; 6,268,125 are contemplated in the methods of the invention. FACS can also be used to measured affinity, as is known in the art.

Pharmaceutical Compositions

The present invention also provides pharmaceutical compositions comprising the heterodimers or heterodimer pairs described herein. Such compositions comprise a therapeutically effective amount of the heterodimer or heterodimer pair, and a pharmaceutically acceptable carrier. In a specific embodiment, the term “pharmaceutically acceptable” means approved by a regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in animals, and more particularly in humans. The term “carrier” refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is administered. Such pharmaceutical carriers can be sterile liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Water is a preferred carrier when the pharmaceutical composition is administered intravenously. Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid carriers, particularly for injectable solutions. Suitable pharmaceutical excipients include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, glycerol, propylene, glycol, water, ethanol and the like. The composition, if desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. These compositions can take the form of solutions, suspensions, emulsion, tablets, pills, capsules, powders, sustained-release formulations and the like. The composition can be formulated as a suppository, with traditional binders and carriers such as triglycerides. Oral formulation can include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, etc. Examples of suitable pharmaceutical carriers are described in “Remington's Pharmaceutical Sciences” by E. W. Martin. Such compositions will contain a therapeutically effective amount of the compound, preferably in purified form, together with a suitable amount of carrier so as to provide the form for proper administration to the patient. The formulation should suit the mode of administration.

In certain embodiments, the composition comprising the heterodimer or heterodimer pair is formulated in accordance with routine procedures as a pharmaceutical composition adapted for intravenous administration to human beings. Typically, compositions for intravenous administration are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a local anesthetic such as lignocaine to ease pain at the site of the injection. Generally, the ingredients are supplied either separately or mixed together in unit dosage form, for example, as a dry lyophilized powder or water free concentrate in a hermetically sealed container such as an ampoule or sachette indicating the quantity of active agent. Where the composition is to be administered by infusion, it can be dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where the composition is administered by injection, an ampoule of sterile water for injection or saline can be provided so that the ingredients may be mixed prior to administration.

In certain embodiments, the compositions described herein are formulated as neutral or salt forms. Pharmaceutically acceptable salts include those formed with anions such as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed with cations such as those derived from sodium, potassium, ammonium, calcium, ferric hydroxide isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, procaine, etc.

The amount of the composition described herein which will be effective in the treatment, inhibition and prevention of a disease or disorder associated with aberrant expression and/or activity of a therapeutic protein can be determined by standard clinical techniques. In addition, in vitro assays may optionally be employed to help identify optimal dosage ranges. The precise dose to be employed in the formulation will also depend on the route of administration, and the seriousness of the disease or disorder, and should be decided according to the judgment of the practitioner and each patient's circumstances. Effective doses are extrapolated from dose-response curves derived from in vitro or animal model test systems.

Uses of Heterodimer Pairs

As described above, the heterodimer pairs according to the invention can comprise a first heterodimer and a second heterodimer, where the immunoglobulin heavy chain and the immunoglobulin light chain of each heterodimer is derived or engineered from a known therapeutic antibody or from a known antibody that binds a molecule. Thus, it is contemplated that heterodimers derived or engineered from these antibodies could be used for the treatment or prevention of the same disease, disorder, or infection that the known therapeutic antibody or known antibody can be used for.

Thus, in one embodiment, heterodimer pairs according to the invention that comprise a heterodimer with heavy and light chains derived from a therapeutic antibody that can be used for the treatment and/or prevention of cancer and related disorders, can also be used for the treatment and/or prevention of cancer and related disorders.

In another embodiment, heterodimer pairs according to the invention that comprise a heterodimer with heavy and light chains derived from a therapeutic antibody that can be used for preventing, treating, or managing the symptoms of an inflammatory disorder in a subject, can also be used for preventing, treating, or managing the symptoms of an inflammatory disorder in a subject.

In another embodiment, heterodimer pairs according to the invention that comprise a heterodimer with heavy and light chains derived from a therapeutic antibody that can be used for the treatment or prevention of autoimmune disease or inflammatory disease in a subject, can also be used for the treatment or prevention of autoimmune disease or inflammatory disease in a subject.

In another embodiment, heterodimer pairs according to the invention that comprise a heterodimer with heavy and light chains derived from a therapeutic antibody that can be used for the treatment or prevention of an infectious disease in a subject, can also be used for the treatment or prevention of an infectious disease in a subject.

In another embodiment, heterodimer pairs according to the invention that comprise a heterodimer with heavy and light chains derived from a therapeutic antibody that can be used for the treatment of vascular disease in a subject, can also be used for the treatment of vascular disease in a subject.

In another embodiment, the heterodimer pairs according to the invention may also be advantageously utilized in combination with other therapeutic agents known in the art for the treatment or prevention of a cancer, autoimmune disease, inflammatory disorders or infectious diseases. In a specific embodiment, the heterodimer pairs according to the invention may be used in combination with monoclonal or chimeric antibodies, lymphokines, or hematopoietic growth factors (such as, e.g., IL-2, IL-3 and IL-7), which, for example, serve to increase the number or activity of effector cells which interact with the molecules and, increase immune response. The heterodimer pairs according to the invention may also be advantageously utilized in combination with one or more drugs used to treat a disease, disorder, or infection such as, for example anti-cancer agents, anti-inflammatory agents or anti-viral agents.

Kits

The present invention additionally provides for kits comprising one or more heterodimer pairs. Individual components of the kit would be packaged in separate containers and, associated with such containers, can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use or sale. The kit may optionally contain instructions or directions outlining the method of use or administration regimen for the heterodimer pairs.

When one or more components of the kit are provided as solutions, for example an aqueous solution, or a sterile aqueous solution, the container means may itself be an inhalant, syringe, pipette, eye dropper, or other such like apparatus, from which the solution may be administered to a subject or applied to and mixed with the other components of the kit.

The components of the kit may also be provided in dried or lyophilized form and the kit can additionally contain a suitable solvent for reconstitution of the lyophilized components. Irrespective of the number or type of containers, the kits of the invention also may comprise an instrument for assisting with the administration of the composition to a patient. Such an instrument may be an inhalant, nasal spray device, syringe, pipette, forceps, measured spoon, eye dropper or similar medically approved delivery vehicle.

Computer Implementation

In one embodiment, a computer comprises at least one processor coupled to a chipset. Also coupled to the chipset are a memory, a storage device, a keyboard, a graphics adapter, a pointing device, and a network adapter. A display is coupled to the graphics adapter. In one embodiment, the functionality of the chipset is provided by a memory controller hub and an I/O controller hub. In another embodiment, the memory is coupled directly to the processor instead of the chipset.

The storage device is any device capable of holding data, like a hard drive, compact disk read-only memory (CD-ROM), DVD, or a solid-state memory device. The memory holds instructions and data used by the processor. The pointing device may be a mouse, track ball, or other type of pointing device, and is used in combination with the keyboard to input data into the computer system. The graphics adapter displays images and other information on the display. The network adapter couples the computer system to a local or wide area network.

As is known in the art, a computer can have different and/or other components than those described previously. In addition, the computer can lack certain components. Moreover, the storage device can be local and/or remote from the computer (such as embodied within a storage area network (SAN)).

As is known in the art, the computer is adapted to execute computer program modules for providing functionality described herein. As used herein, the term “module” refers to computer program logic utilized to provide the specified functionality. Thus, a module can be implemented in hardware, firmware, and/or software. In one embodiment, program modules are stored on the storage device, loaded into the memory, and executed by the processor.

It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims.

Examples

Below are examples of specific embodiments for carrying out the present invention. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the present invention in any way. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, etc.), but some experimental error and deviation should, of course, be allowed for.

The practice of the present invention will employ, unless otherwise indicated, conventional methods of protein chemistry, biochemistry, recombinant DNA techniques and pharmacology, within the skill of the art. Such techniques are explained fully in the literature. See, e.g., T. E. Creighton, Proteins: Structures and Molecular Properties (W.H. Freeman and Company, 1993); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current addition); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Remington's Pharmaceutical Sciences, 18th Edition (Easton, Pa.: Mack Publishing Company, 1990); Carey and Sundberg Advanced Organic Chemistry 3rd Ed. (Plenum Press) Vols A and B (1992).

Example 1: Preparation of Constructs Encoding D3H44 IgG Heavy Chains and D3H44 IgG Light Chains

The wild-type heavy and light chains of the anti-tissue factor antibody D3H44 for use in the co-expression sets described herein were prepared as follows. D3H44 Fab light (AJ308087.1) and heavy (AJ308086.1) chain sequences were taken from GenBank (Tables A, A1, and A2), gene synthesized and codon optimized for mammalian expression. Light chain vector inserts, consisting of a 5′-EcoRI cutsite-HLA-A signal peptide-HA or FLAG tag-Light chain Ig clone-‘TGA stop’-BamH1 cutsite-3′, were ligated into a pTT5 vector (Durocher Y et al., Nucl. Acids Res. 2002; 30, No. 2 e9). The resulting vector+insert were sequenced to confirm correct reading frame and sequence of the coding DNA. Likewise, heavy chain vector inserts, consisting of a 5′-EcoRlcutsite-HLA-A signal peptide-heavy chain clone (terminating at T238; see Table A1)-ABD2-His6tag-TGA stop-BamH1 cutsite-3′ (“His6” disclosed as SEQ ID NO: 2), were ligated into a pTT5 vector (ABD; albumin binding domain). The resulting vector+insert were also sequenced to confirm correct reading frame and sequence of the coding DNA. The various D3H44 constructs were generated either by gene synthesis or by site-directed mutagenesis (Braman J, Papworth C & Greener A., Methods Mol. Biol. (1996) 57:31-44).

Heavy and light chains were tagged at the C- and N-terminals respectively, in order to facilitate the assessment of preferential pairing via a competition assay-SPR screen. The ABD2-His₆ heavy chain tag (“His₆” disclosed as SEQ ID NO: 2) specifically allowed HC-LC complexes to be captured on an anti-his tag SPR chip surface, whilst FLAG and HA light chain tags allowed the relative LC1 and LC2 populations to be quantified.

Example 2: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Variable Domain Modifications in D3H44 IgG Light and/or Heavy Chains

The ability of heterodimers to preferentially pair in co-expression sets comprising D3H44 heavy and light chains with modified V_(L) and/or V_(H) domains was determined and the results are shown in FIG. 1. The results provided in FIG. 1 are preliminary and a more complete set of results is provided below. The amino acid modifications shown in FIGS. 1-3 are identified with reference to the amino acid sequence of D3H44 heavy chain and D3H44 light chain. See Tables A, A1, and A2.

One D3H44 heavy chain construct was co-expressed with two unique D3H44 light chain constructs and the relative light chain pairing specificity (e.g. H1-L1:H1-L2) was determined from a competition assay-SPR screen (Column entitled “Competition assay screen results” in FIG. 1). Selected heterodimer hits were verified via a light chain competition assay verification whereby L1:L2 DNA ratios were varied by 40:60, 50:50 and 60:40 during transfection (Column entitled “Competition assay verification results” in FIG. 1). Heavy chain were kept in limiting quantities (i.e. HC<L1+L2) for both competition assay screens and verifications. A schematic representing the design of the assay is shown in FIG. 8.

The methods were carried out as follows: The Light Chain Competition Assay (LCCA) quantifies the relative pairing of one heavy chain for at least two unique light chains. The assay and the preceding steps can be summarized as follows: 1. Concomitant expression of heavy and light chains, with the heavy chain being in limiting amounts (e.g. HC:LC1:LC2=1:1:1), 2. Isolation of HC-LC complexes—achieved by binding heavy chains to the SPR chip via a his-tag pull-down, and 3) Quantification of relative HC-LC populations (i.e. H1-L1:H1-L2). In the SPR format, antibodies specific for unique light chain-tagged populations are used for the quantification. Note: This assay can be carried out with or without the H-L disulphide. A schematic diagram representing the method is shown in FIG. 9.

Transfection Method

Co-expression sets comprising one heavy chain and two light chain constructs prepared as described in Example 1 were transfected into CHO-3E7 cells as follows. CHO-3E7 cells, at a density of 1.7-2×10⁶cells/ml, were cultured at 37° C. in FreeStyle™ F17 medium (Invitrogen cat# A-1383501) supplemented with 4 mM glutamine and 0.1% Pluronic F-68 (Invitrogen cat#24040-032). A total volume of 2 ml were transfected with a total of 2 ug DNA using PEI-pro (Polyplus cat#115-010) at a DNA:PEI ratio of 1:2.5. Twenty-four hours after the addition of the DNA-PEI mixture, the cells were transferred to 32° C. Supernatants were tested for expression on day 7 by non-reducing SDS-PAGE analysis followed by Coommassie blue staining to visualize the bands. HC: LC ratios are as indicated in Table 7.

TABLE 7 DNA quantity used for transfection (ng) HC:L1:L2 Stuffer{circumflex over ( )} ratio Experiment HC LC1 LC2 DNA 1:1:1 Competition 333 333 333 1000 assay screen 1:1:1 Competition 333 333 333 1000 assay verification 1:0.8:1.2 Competition 333 266 400 1000 assay verification 1:1:1 Competition 333 333 333 1000 assay verification 1:1.2:0.8 Competition 333 400 266 1000 assay verification {circumflex over ( )}Stuffer DNA: _(P)TT5 vector without a DNA insert.

Competition Assay SPR Method

The degree of preferential D3H44 light chain pairing to D3H44 heavy chain in co-expression sets was assessed using an SPR-based readout of unique epitope tags located at the N-terminus of each light chain.

Surface Plasmon Resonance (SPR) Supplies.

GLM sensorchips, the Biorad ProteOn amine coupling kit (1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC), N-hydroxysulfosuccinimide (sNHS) and ethanolamine), and 10 mM sodium acetate buffers were purchased from Bio-Rad Laboratories (Canada) Ltd. (Mississauga, ON). Recombinant Her-2 protein was purchased from eBioscience (San Diego, Calif.). 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES) buffer, ethylenediaminetetraacetic acid (EDTA), and NaCl were purchased from Sigma-Aldrich (Oakville, ON). 10% Tween 20 solution was purchased from Teknova (Hollister, Calif.).

SPR Biosensor Assays.

All surface plasmon resonance assays were carried out using a BioRad ProteOn XPR36 instrument (Bio-Rad Laboratories (Canada) Ltd. (Mississauga, ON)) with PBST running buffer (PBS Teknova Inc with 0.05% Tween20) at a temperature of 25° C. The anti-penta His (SEQ ID NO: 3) capture surface was generated using a GLM sensorchip activated by a 1:5 dilution of the standard BioRad sNHS/EDC solutions injected for 140 s at 100 μL/min in the analyte (horizontal) direction. Immediately after the activation, a 25 μg/mL solution of anti-penta His antibody (SEQ ID NO: 3) (Qiagen Inc.) in 10 mM NaOAc pH 4.5 was injected in the analyte (vertical) direction at a flow rate of 25 μL/min until approximately 3000 resonance units (RUs) were immobilized. Remaining active groups were quenched by a 140 s injection of 1M ethanolamine at 100 μL/min in the analyte direction, and this also ensures mock-activated interspots are created for blank referencing.

The screening of the heterodimers for binding to the anti-FLAG (Sigma Inc.) and anti-HA (Roche Inc.) monoclonal antibodies occurred in two steps: an indirect capture of the heterodimers onto the anti-penta His surface (SEQ ID NO: 3) in the ligand direction followed by an anti-FLAG and anti-HA injection in the analyte direction. First, one buffer injection for 30 s at 100 uL/min in the ligand direction was used to stabilize the baseline. For each heterodimer capture, unpurified heterodimers in cell-culture media were diluted to 4% in PBST. One to five heterodimers or controls (i.e. controls containing either 100% HA-light chain or 100% FLAG-light chain) were simultaneously injected in individual ligand channels for 240 s at flow 25 L/min. This resulted in a saturating heterodimer capture of approximately 300 to 400 RUs onto the anti-penta His surface (SEQ ID NO: 3). The first ligand channel was left empty to use as a blank control if required. This heterodimer capture step was immediately followed by two buffer injections in the analyte direction to stabilize the baseline, and then 5 nM anti-FLAG and 5 nM anti-HA were each injected in duplicate at 50 μL/min for 120 s with a 180 s dissociation phase, resulting in a set of binding sensorgrams with a buffer reference for each of the captured heterodimer. Where possible, the antigen to which the heterodimer binds can also be injected over the last remaining analyte channel as an activity control. The heterodimers were regenerated by an 18 s pulse of 0.85% phosphoric acid for 18 s at 100 μL/min to prepare the anti-penta His surface (SEQ ID NO: 3) for the next injection cycle. Sensorgrams were aligned and double-referenced using the buffer blank injection and interspots, and the resulting sensorgrams were analyzed using ProteOn Manager software v3.0.

The total percentage of L1 and L2 should, theoretically, add up to 100%. In practice, it was observed for some variants that the total amount of L1 and L2 added up to significantly less than 100%. This discrepancy in total light chain percentage is believed to be due in part to the occurrence of variable non-specific binding during initial heterodimer capture on the SPR chip.

Example 3: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Constant (CL or CH1) Domain Modifications in D3H44 IgG Light and/or Heavy Chains

The ability of heterodimers to preferentially pair in co-expression sets comprising D3H44 heavy and light chains with modified CL and/or CH1 domains was determined as described for heterodimers with variable domain modifications in Example 2, and the results are shown in FIG. 2. One D3H44 heavy chain construct was co-expressed with two unique D3H44 light chain constructs and the relative light chain pairing specificity (e.g. H1-L1:H1-L2) was determined from a competition assay-SPR screen (Column entitled “Competition assay screen results” in FIG. 2). Selected heterodimer hits were confirmed via a modified competition assay verification where DNA ratios of L1:L2 were varied by 40:60, 50:50 and 60:40 during transfection (Column entitled “Competition assay verification results” in FIG. 2). As described in Example 2, heavy chain was kept in limiting quantities (i.e. HC<L1+L2) for both competition assay screens and verifications. Assessment of preferential pairing was carried out as described in Example 2.

Example 4: Scale Up for Biophysical Characterization

Selected heterodimers, both paired and mispaired, were scaled up (typically to 50 ml) and purified as follows in order to test for thermal stability and antigen binding. Heterodimers HD100-HD115, as shown in FIG. 3 were expressed and purified. The heavy and light chain of each heterodimer was expressed in 50 ml cultures of CHO-3E7 cells under the culture conditions described above. Cells were centrifuged and heterodimers purified by loading the supernatant on Fractogel column charged with Nickel as described below.

Purification on Fractogel Column Charged with Nickel (His)

Charging the column with Nickel: Sequentially wash with 5 column volumes (CV) of 0.5 M NaCl (no pH adjustment), followed by 4 CV 200 mM of NiCl₂ (Nickel) and 2 CV of 0.5 M NaCl pH 5.0. Sample loading and elution: Equilibrate column with 10 CV PBS. Load sample and wash with 10 CV of wash buffer #1 (50 mM sodium phosphate pH 7.0, 300 mM NaCl) followed by 10 CV of wash buffer #2 (50 mM sodium phosphate pH 7.0, 300 mM NaCl, 25 mM Imidazole) to remove impurities bound to the column. The heterodimers were eluted in fractions with wash buffer #1+300 mM Imidazole. The protein content of each fraction was tested by Bradford protein assay. Fractions containing protein were pooled. The purified heterodimers were then assayed for antigen binding and thermal stability as described in Example 5.

Example 5: Thermal Stability and Antigen Affinity Measurements of Heterodimers

The thermal stability and antigen affinity of selected heterodimer pairs was measured in order to compare these features with that of wild type, unmodified heavy chain-light chain pair. Correctly paired and mispaired heterodimers from co-expression sets were individually scaled up, purified (i.e. His tag affinity purification) and assessed for thermal stability and antigen binding as described below. The results are shown in FIG. 3.

Measurement of Thermal Stability

The thermal stability of selected heterodimer pairs was measured using differential scanning calorimetry (DSC) as follows.

Each heterodimer was purified as described in Example 3 and diluted to 0.2 mg/mL in PBS, and a total of 400 μL was used for DSC analysis with a VP-Capillary DSC (GE Healthcare). At the start of each DSC run, 5 buffer blank injections were performed to stabilize the baseline, and a buffer injection was placed before each heterodimer injection for referencing. Each sample was scanned from 20 to 100° C. at a 60° C./hr rate, with low feedback, 8 sec filter, 5 min preTstat, and 70 psi nitrogen pressure. The resulting thermograms were referenced and analyzed using Origin 7 software.

Thermal unfolding curves for the heterodimers tested are shown in FIG. 4. The results indicate that the correctly paired heterodimer (from a design perspective) is usually significantly more stable than the intended mispaired heterodimer (e.g. HD107 versus HD108). In addition, many of the correctly paired heterodimer exhibit a thermal stability close to wild-type Fab (e.g. HD 114).

Measurement of Antigen Affinity

The affinity of the heterodimer pairs for antigen (tissue factor extracellular domains) was measured using surface plasmon resonance (SPR) assays. All surface plasmon resonance assays were carried out using a BioRad ProteOn XPR36 instrument (Bio-Rad Laboratories (Canada) Ltd. (Mississauga, ON) with PBST running buffer (PBS Teknova Inc with 0.05% Tween20) at a temperature of 25° C. A purified tissue factor (TF) surface was generated using a GLM sensorchip activated by a 1:10 dilution of the standard BioRad sNHS/EDC solutions injected for 140 s at 100 μL/min in the ligand (vertical) direction. Immediately after the activation, a 25 μg/mL solution of TF in 10 mM NaOAc pH 4.5 was injected in the ligand direction at a flow rate of 25 μL/min until approximately 1000 resonance units (RUs) were immobilized (or enough for a 100 RU maximum response when flowing 60 nM FAB). Remaining active groups were quenched by a 140 s injection of 1M ethanolamine at 100 μL/min in the analyte direction. For each injection series, two buffer blank injections in the horizontal injection preceded the purified heterodimer. A 3-fold dilution series of each heterodimer (60 nM, 20 nM, 6.7 nM, 2.2 nM) with a blank buffer control was simultaneously injected at 50 μL/min for 120 s with a 20 minute dissociation, resulting in a set of binding sensorgrams with a buffer reference for each of the heterodimers. The heterodimer:TF complexes on the SPR surface were regenerated by an 18 s pulse of 0.85% phosphoric acid for 18 s at 100 μL/min to prepare the TF surface for the next injection cycle. Sensorgrams were aligned and double-referenced using the buffer blank injection and interspots, and the resulting sensorgrams were analyzed using a 1:1 binding model within the ProteOn Manager software v3.0.

The results indicate that the correctly paired heterodimer (from a design perspective) exhibits a range of affinities for antigen, with some designs showing wild-type like binding affinity for antigen (e.g. HD107 and HD 114).

Example 6: Size Exclusion Chromatography (SEC) Profiles of Wild-Type Tagged D3H44 Heterodimers and a Representative Sample of Individual Preferentially Paired Heterodimers

Wild-type D3H44 heterodimer (one heavy chain and one light chain) with a C-terminus ABD2-His₆ tag (“His₆” disclosed as SEQ ID NO: 2) on the heavy chain and an N-terminus FLAG tag on the light chain were expressed and purified according to methods known in the art and similar to those described in Examples 1 and 4. Preferentially or correctly paired heterodimers from co-expression sets (heterodimers HD100, HD105, and HD107, shown in FIG. 3) were individually scaled up and purified via His tag affinity purification as described in Example 4 and SEC.

SEC was carried out as follows. Heterodimer samples were separated using a Superdex 200 HR 10/30 Pharmacia (GE Healthcare) column mounted on a Pharmacia (GE Healthcare) ÄKTA Purifier system. Heterodimer samples (0.3-0.5 ml) in PBS were manually loaded into a 0.5 ml loop filled with PBS. Samples were than automatically injected onto the column and resolved at 0.5 ml/min with a 1 CV elution volume. Protein elution was monitored at OD₂₈₀ and collected in 1 ml fractions.

As shown in FIG. 5, correctly paired heterodimers displayed SEC profiles close to that observed for wild-type heterodimer without amino acid modifications (Main peak [*]:heterodimer). Equivalent results are obtained when the light chain of the wild-type heterodimer has a N-terminus HA tag.

Example 7: Additional Data Relating to the Stability of Heterodimers

Designs shown in Table 8 were highlighted as combinations of design drivers with improved HC-LC selectivities.

TABLE 8 H1_ L1_ H2_ L2_ Design Set # Set # mutation mutation mutation mutation 11 C525 C526 S188L_ V133S F174V_ S176L V190Y P175S_ S188G 12 V042 V043 V37E_ L89R_ WT WT F100D F98W 13 C532 C533 L143A_ Q124E_ K145T_ V133A_ D144G_ V133W_ Q179D_ Q160K_ Q179R Q160E_ S188F T178R T180E 14 D146G_ Q124E_ K145E_ Q124R_ S186R Q160E_ D146G_ Q160K_ T178D Q179D_ T178R S188L 15 C530 C531 D146G_ Q124E_ K145T_ Q160K_ Q179R Q160E_ Q179D_ T178R T178D S188L

Residue numbering follows Kabat nomenclature (Kabat E. A. et al., (1983) Sequence of Proteins of Immunological Interest National Institutes of Health, Bethesda).

The majority of the designs retained wild-type like thermal stability (Tm) and TF binding affinity as shown in Table 9.

TABLE 9 TF Binding Tm (° C.) KD (nM) H1 Readout H2 Readout Design H1-L1 Tm H2-L2 Tm H1-L1 H2-L2 H1-L1 H1-L2 H2-L2 H2-L1 11 74.7 73.3 0.043 0.040 83 24 98 1 12 76.2 76.0 0.052 99 10 88 15 13 67.5 71.5 0.087 0.086 104 1 87 14 14 84 1 74 2 15 70.1 76.3 0.082 0.071 93 1 83 24

Design 12 contains a wild-type H2-L2 pairing (see Table 8). As a result, H2-L2 Fab retains wild-type binding affinity. Wild-type anti-TF D3H44 Fab Tm=˜76° C. (data not shown).

Example 8: Additional Heterodimers and Testing of Same

Additional heterodimer pairs as described in Table 10 were prepared and tested. These heterodimers were designed to increase Fab hotspot coverage.

TABLE 10 Design H1_mutation* L1_mutation H2_mutation L2_mutation 26 A141G_V185A L135W A141W_K147Y_Q175E F116A_S131K_L135A 27 A141I_K147T_D148G_Q175E_(—) F116A_V133G_S176F_T178A S181K_S183H_V185G F118W_Q124E_V133S_S176A_(—) S183G_V185S T178S_T180E 28 A141V_K147L_Q175E_(—) F116A_S131K_V133G_(—) A141W_S181K_S183A F118W_V133S_S176A_T180E S183G_V185S S176F_T178A 29 L145K_D148G Q124E_V133D L145E_K147T Q124R 30 D148G_Q175K Q124E_Q160E_T180E L145E_K147T Q160K_T178R 31 Q39D_A141G_V185A Q38R_L135W Q39R_A141W Q38D_F116A_L135A 32 Q39E Q38R Q39R Q38E_F98W *Residue numbering in Table 10 follows the convention used for residues in the crystal structure of D3H44 Fab (PDB ID = 1JPT [Faelber K et al., J. Mol. Biol. (2001) 313: 83-97]; www.resb.org/pdb/explore/explore.do?structureId=1JPT).

The stability, ability to bind to target, and the ability to selectively pair for these heterodimers was determined as described in Example 5 and are shown in Table 11.

TABLE 11 TF Binding Tm (° C.) KD (nM) H1 Readout H2 Readout Design H1-L1 Tm H2-L2 Tm H1-L1 H2-L2 H1-L1 H1-L2 H2-L2 H2-L1 26 67.8 73.6 92 1 106 1 27 71.0 67 1 85 12 28 85 4 94 1 29 64.1 68.3 93 1 102 1 30 67.0 75.5 0.024 0.060 96 1 102 1 31 74.3 0.025 106 1 78 1 32 98 1 93 1

Wild-type anti-TF D3H44 Fab KD=0.052 nM; Wild-type anti-TF Fab Tm=˜76° C.

Example 9: Additional Heterodimers

The following heterodimer pairs were also prepared and tested for their ability to selectively pair.

Residue numbering in Table 12 follows the convention used for residues in the crystal structure of D3H44 Fab (PDB ID⁼1JPT [Faelber K et al., J. Mol. Biol. (2001) 313:83-97]; www.rcsb.org/pdb/explore/explore.do?structureId=1JPT).

TABLE 13 Design H1_mutation L1_mutation H2_mutation L2_mutation 4691 S181R Q124E_Q160E_T180E K147L_Q175E S131K 4686 D148G_Q175K Q124E_Q160E_T180E L145E_K147T Q124R_Q160K_T178R 5838 Q39E_S181R Q38R_Q124E_Q160E_T180E Q39R_K147L_Q175E Q38E_S131K 5826 Q39E_S181R Q38R_Q160E_T180E Q39R_K147T_Q175E Q38E_S131K Design 1 L145K_D148G Q124E_V133D L145E_K147T Q124R Design 2 D148G_Q175K Q124E_Q160E_T180E L145E_K147T Q124R_Q160K_T178R Design 3 S181R Q124E_Q160E_T178D K147L_Q175E S131K K147L_Q175E S131K D148G_Q175K Q124E_Q160E_T180E Design 4 V37I_Q39R Q38D_F98W V37W_Q39E Q38R_F98A Design 5 Q39R Q38E V37W_Q39E Q38R_F98A Design 6 V37I_Q39D Q38R_F98W V37W_Q39R_W107F Q38E_F98L

Residue numbering in Table 13 follows the convention used for residues in the crystal structure of D3H44 Fab (PDB ID=1JPT [Faelber K et al., J. Mol. Biol. (2001) 313:83-97]; www.rcsb.org/pdb/explore/explore.do?structureId=1JPT)

Example 10: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Constant Domain and/or Variable Domain Modifications in D3H44 Heavy and Light Chain Fab Format

Co-expression sets in addition to those shown in FIGS. 1 and 2 were designed. Constructs encoding the D3H44 IgG heavy and light chains in Fab format comprising amino acid modifications according to the design of the co-expression set were prepared as described in Example 1. The ability of the D3H44 heavy and light chain Fab pairs to preferentially pair was assessed as described in Examples 2 and 3. The stability and binding affinity of the designs was determined as described in Example 5. The results shown in Tables 14 and 15 are cumulative and include results for designs shown in FIGS. 1 and 2 in addition to new designs. The amino acid modifications shown in these tables are identified with reference to the amino acid sequence of D3H44 heavy chain and D3H44 light chain. See Tables A, A1, and A2.

Note that “Design” or “Design set” in this application is referring to a set of mutations on H1, L1, H2, and L2 chains. “LCCA design” refers to set of mutations in H1, L1 and L2.

Each unique set of H1, L1 and L2 mutations (LCCA format) was assigned a unique number, or so called ‘unique identifier’. When data is presented in H1 L1 H2 L2 format (Fab pair format or SMCA), such a design set is consequently denoted with a ‘unique identifier set’ comprised of unique identifiers for the two constituent LCCAs (e.g. 1-2). Designs featured in the D3H44 LCCA data set were assigned first. Designs that are not present in this set, but are present in different homogeneous or mixed system data or/and in different formats (MCA) are in addition denoted with *. In this exercise of assigning unique identifiers, via automatic processing of numerous tables, some redundancy has arisen. Cases where different WT amino acids in systems other than D3H44 occupy the same position are: 309*=319*=47, 316*=101, 317*=183, 318*=182, 310*=48, 311*=102, 323*=180, 324*=179. Cases where additional mutations were incorporated for LC/MS are: 442*=326*, and 443*=23.

Note that the majority of LCCA experiments were performed on constructs lacking interchain Fab disulfide bond(s) located in the constant domain (H/C233S-L/C214S).

In Table 14 provided are designs that exhibit correct pairing specificity of 55%: 45% (H1-L1:H1-L2 and H2-L2:H2-L1) or greater. For the purposes of highlighting a particular design's success with respect to preferential pairing, two complementary LCCA sets (H1, L1, L2 and H2, L2, L1) are represented in a pair Fab format.

Presence of tags (L: HA and FLAG and H: ABD2) does not affect the expected neutral pairing of ˜50%: 50% for D3H44 WT (this is further supported by pairing results for the actual designs; hence tag information is not included in this table).

In the table, the measured amount of relevant Fab species (H1-L1, H1-L2 and that of H2-L2, H2-L1) was included in ratio format (H1-L1:H1-L2 and H2-L2:H2-L1). In the majority of cases, several LCCA experimental repeats were performed (screening and verification). A summary column in the format of a normalized ratio (i.e. to 100% H1-L1 and H1-L2 sum) for the median H1-L1:H1-L2 and H2-L2:H2-L1 is also provided.

Data was clustered according to thermal stability data (Tm) and antigen affinity (in table buckets, the ‘TF’ notation is used for tissue factor affinity) categories:

Tm1 (x=>71° C.); Tm2 (71° C.>x=>66° C.); Tm3 (66° C.>x). Tm3 category also includes ND (experiment not performed) cases.

For reference, the Tm of D3H44 WT Fab (without disulfide bond) is ˜76° C.

TF1 (x=<5×KD of WT median value); TF2 (5<x=<20×KD of WT median value); and miscellaneous category that includes cases of x>20×KD of WT median value where NB cases (no binding: for KD greater than 500 nM, which is ˜10000×KD of WT median) were labeled separately. This last category also includes ND cases (experiment not performed).

For reference, the median KD of D3H44 WT Fab is 0.06 nM (with a range of 0.1).

Note: In Table columns referring to antigen affinity and thermal stability, range (min-max reading) is indicated if the number of experiments performed was greater than 1 (n>1).

Within each bucket, designs are ordered in descending pairing specificity of H1-L1:H1-L2 followed by that of H2-L2:H2-L1.

An example of reading bucket categories in the table:

Tm1 only (both H1-L1 and H2-L2 Tm belong to Tm1 category)

Tm1/Tm2 (H1-L1 or H2-L2 Tm belongs to Tm1 and the other to Tm2 category)

The same logic applies to ‘TF’ categories.

An additional set of LCCA results (Table 15), also presented in the Fab pair format, obtained following an additional design cycle is included in a separate table. This set of data is arranged in the order of decreasing pairing specificity and contains somewhat limited data with respect to thermal stability.

Results in Table 14 and 15 demonstrate that our in silico design approach led to achievement of preferential pairing of H1-L1 over H1-L2 and that of H2-L2 over H2-L1 across a diverse set of designs and their variations. These designs generally fell into two main categories: electrostatic (based on specificity drivers that utilize hydrogen bonding or charge-charge interactions) and steric complementarity. Specificity of such pairing ranged from moderate to 100% correct pairing for both LCCA designs. As evident from the table, some of these designs did not impact thermal stability (Tm) or antigen binding affinity, while some exhibited various degrees of impact on these two properties. Furthermore, successful designs were present in both constant and variable domain, as well as in domain design combination formats.

Example 11: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Constant Domain and/or Variable Domain Modifications in Mixed Ab or Pure Ab Heavy and Light Chain Fab Format

Certain designs described in the previous examples were tested in a system where the heterodimer pairs were derived from a different Ab (cf. to D3H44) or two different antibodies, to assess whether the design of the co-expression set resulted in preferential pairing in these types of systems. A number of different systems were tested. In one example, one heterodimer pair was derived from D3H44 heavy and light chains in the Fab format and the second heterodimer pair was derived from pertuzumab heavy and light chains in the Fab format. In another example, one heterodimer pair was derived from D3H44 heavy and light chains in the Fab format and the second heterodimer pair was derived from trastuzumab heavy and light chains in the Fab format.

Constructs encoding the D3H44 IgG, pertuzumab, and trastuzumab heavy and light chains in Fab format comprising amino acid modifications according to the design of the co-expression set were prepared as described in Example 1. The base DNA sequence for the heavy chain of pertuzumab, the base DNA sequence for the light chain of pertuzumab, the base DNA sequence for the heavy chain of trastuzumab, and the base DNA sequence for the light chain of trastuzumab are shown in Tables A, A1, and A2. Amino acid modifications were introduced into these sequences by site directed mutagenesis, or the DNA sequences were synthesized including the amino acid modifications from the base sequences as described in Example 1.

The ability of the heterodimer designs to preferentially pair was assessed as described in Examples 2 and 3, except for the fact that when mixed systems were tested the non-obligate chain employed belonged to a different Fab.

The results are shown in Table 16. The amino acid modifications shown in Table 16 are identified with reference to the amino acid sequence of D3H44 heavy chain and D3H44 light chain; pertuzumab heavy chain and pertuzumab light chain; trastuzumab heavy chain and trastuzumab light chain. See Tables A, A1, and A2.

A representative and diverse subset of designs, that exhibited successful preferential pairing in D3H44 system, were tested in different systems (Trastuzumab (TRAS) and Pertuzumab (PERT), as well as in mixed systems (D3H44/TRAS, D3H44/PERT and TRAS/PERT) (as with D3H44 LCCA, constructs lacked Fab disulfide).

Data is presented in both LCCA (Table 16) and Fab pair formats (Table 17). LCCA data reflects the minimal ‘competition unit’ (i.e. H1-L1:H1-L2) and is the optimal format for interpreting whether an LCCA design can successfully transfer across Fabs. Analysis in the Fab pair format including the second Fab pair (i.e. H2-L2:H2-L1) further illustrated the degree of translation into whole design (i.e. H1-L1 and H2-L2) and its efficacy in these different Fab systems. Apart from the ratio of H1-L1:H1-L2, we present the relative propensity for correct pairing relative to incorrect pairing as a scalar, where Scalar=ln(H1-L1:H1-L2).

In some of the mixed systems, an inherent cross-system pairing preference (e.g. H_D3H44 pairs preferentially with L_PERT than L_D3H44) was observed (Table 17). In the same example, H_D3H44 preferential pairing with L_PERT was also supported by thermal stability measurements (Tm), which indicated that H_D3H44-L_PERT species was more stable than H_D3H44-L_D3H44. Hence, along with reporting the actual species amounts measured, the normalized data to respective WT LCCA system (REF) behavior is presented in the form of ΔScalar(VAR-REF_WT) where ΔScalar=ln (H1-L1:H1-L2/H2-L2:H2-L1). This metric is an indicator of the actual effectiveness of the LCCA design. Data included in the tables comprise designs that yielded an equivalent of 55%: 45% or greater paring specificity (i.e. ΔScalar (VAR-REF_WT)>0.2).

Unlike in the D3H44 system, certain species ratios appear to be at times affected by the light chain tag in WT PERT only and WT TRAS (to a lesser degree) systems (Table 18).

This appears to be due to random events of HA-tag cleavage (LC/MS evidence available when systems are tested in Mab format), rather than tag interference with pairing. Hence tags were taken into account when presenting results in the relevant tables.

A summary of the results, in both LCCA and Fab pair format, across the different systems is reported in Tables 19 and 20. The results indicate design transferability across different tested Fab systems. These results do not necessarily reflect that some of the designs are better than others; nor do they reflect a more comprehensive transferability estimate. Successful LCCA designs (e.g., median ΔScalar (VAR-REF_WT)>0.2) in two systems or more, presented in Table 19, constitute app. 30% of tested LCCA designs in these different systems. This is indicative of transferability, dependent on the particular system and design. Thus, having a collection of unique designs allowed for the creation of bispecific pairs for a number of systems and highlights the utility of a library of design set that can be evaluated in the context of any antibody (or antibody pair) of interest. This example indicates that mutation/design sets can be used to achieve preferential pairing of heterodimers in co-expression sets comprising constant domain and/or variable domain modifications in mixed Ab or pure Ab heavy and light chain Fab format.

Example 12: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Constant Domain and/or Variable Domain Modifications in D3H44 Heavy and Light Chain, in Full-Length Heavy Chain (Mab) Format

The heterodimer co-expression set designs were assessed to determine if they also allowed for preferential pairing in a format (Mab format) where the heavy chain is a full-length heavy chain and not a Fab portion.

Preparation of Constructs:

Constructs encoding the D3H44 IgG heavy and light chains comprising amino acid modifications according to the design of the co-expression set were prepared as follows. D3H44 Fab light chain constructs were prepared as described in Example 1. D3H44 heavy chain sequences were prepared as described in Example 1, except a full-length D3H44 heavy chain was created by appending the IgG1*01 DNA sequence, encoding the hinge-CH2-CH3 domains, onto the C-terminus of the D3H44 Fab heavy chain. Of note, the canonical C-terminal heavy chain lysine residue was removed in order to prevent LC-MS signal heterogeneity due to C-terminal lysine clipping (Lawrence W. Dick Jr. et al., Biotechnol. Bioeng. (2008) 100:1132-43).

Mab Assay Format

The ability of the D3H44 heavy and light chains to preferentially pair was assessed as follows: One full-length D3H44 heavy chain construct was co-expressed with two unique D3H44 light chain constructs, yielding three possible antibody species: H1-L1:H1-L1, H1-L2:H1-L2 and H1-L1:H1-L2 (see FIG. 10). The relative light chain pairing specificity in terms of the amount of preferred H1-L1:H1-L1 species vs. others was determined using LC-MS after proteinA (pA) purification. Where possible, chains were left untagged, provided the three possible Mab species resulting from co-transfection of the three chains differed by at least 50 Da from each other. When mass differences precluded this possibility, at least one of the light chains was constructed with an N-terminal HA or FLAG tag fusion in order to provide sufficient mass differentiation between species. As described in Example 2, heavy chain was kept in limiting quantities (i.e. HC<L1+L2).

Transfection Method for Mab Assay Format

Co-expression sets comprising one heavy chain and two light chain constructs prepared as described in Example 12 were transfected into CHO-3E7 cells as follows. CHO-3E7 cells, at a density of 1.7-2×10⁶cells/ml, were cultured at 37° C. in FreeStyle™ F17 medium (Invitrogen cat# A-1383501) supplemented with 4 mM glutamine and 0.1% Pluronic F-68 (Invitrogen cat#24040-032). A total volume 50 ml were transfected with a total of 50 ug DNA using PEI-pro (Polyplus cat#115-010) at a DNA:PEI ratio of 1:2.5. Twenty-four hours after the addition of the DNA-PEI mixture, the cells were transferred to 32° C. Supernatants were tested for expression on day 7 by non-reducing SDS-PAGE analysis followed by Coommassie blue staining to visualize the bands. HC: L1:L2 ratios used were 1:1:1.

Mass Spectrometry Method for Mab Assay Format

The degree of preferential D3H44 light chain pairing to D3H44 heavy chain in co-expression sets was assessed using mass spectrometry after protein A purification and deglycosylation The purified samples were de-glycosylated with PNGaseF as follows: 0.2 U PNGaseF/tg of antibody in 50 mM Tris-HCl pH 8.0, overnight incubation at 37° C., final protein concentration was 0.45 mg/mL. The protein samples were analyzed by LC-MS using an Agilent 1100 HPLC system coupled to an LTQ-Orbitrap XL hybrid mass spectrometer (ThermoFisher Scientific) via a high-flow electrospray interface. The samples (2.5 tg) were injected onto a 2.1×10 mm Poros R2 column (Applied Biosystems) and eluted using the following gradient conditions: 0-3 min: 20% solvent B; 3-6 min: 20-90% solvent B. Solvent A was 0.1% formic acid aq. and solvent B was 65% ACN, 25% THF, 9.9% ddH₂O, 0.1% formic acid. The flow rate was 1 mL/min. The flow was split post-column to direct 100 μL/min into the electrospray interface. The column and solvent were heated to 80° C. to improve protein peak shape. The LTQ-Orbitrap XL was calibrated using ThermoFisher Scientific's LTQ Positive Ion ESI calibration solution (caffeine, MRFA and Ultramark 1621), and tuned using a 10 mg/mL solutions of CsI. The cone voltage (source fragmentation setting) was 40 V, the FT resolution was 7,500 and the scan range was m/z 400-4,000.

The protein spectra were deconvoluted using the MaxEnt module of the MassLynx instrument control and data analysis algorithm (Waters). Briefly, the raw protein LC-MS data were first opened in QualBrower, the spectrum viewing module of Xcalibur (Thermo Scientific) and converted to be compatible with MassLynx using Databridge, a file conversion program provided by Waters. The converted protein spectra were viewed in the Spectrum module of MassLynx and deconvoluted using MaxEnt. The abundances of the different antibody species in each sample were determined directly from the resulting molecular weight profiles.

The results are shown in Table 21. The amino acid modifications shown in Table 21 are identified with reference to the amino acid sequence of D3H44 heavy chain and D3H44 light chain. See Tables A, A1, and A2.

A subset of designs that exhibited successful preferential pairing in D3H44 system, which are also representatives of a diverse design set, were chosen for assessment of format transferability, i.e., from LCCA with Fab structures to Mab competition assay (MCA) based on Mab structures.

As shown in Table 21, D3H44 WT used in a Mab competition assay exhibited deviation from the expected theoretical species distribution of 50% H1-L1_H1-L2 and 25% of each H1-L1_H1-L1 and H1-L2_H1-L2, where light chains were distinguished by the presence or absence of a tag (HA or FLAG). This deviation is likely a result of tag dependent expression levels rather than tag dependent pairing (e.g., based on experiments performed at different WT H1:L1:L2 ratios, as well as indirect observation on the basis of design behavior). Median values for all the measured species across experimental repeats and/or variants differing by tag identity were included as well.

An example of LC/MS spectra of two successful Mab assay variants (FIG. 11) that comprise a given design are found in FIG. 12. In the case of these variants a majority of the measured species corresponded to the desired H1-L1_H1-L1 species. Expression profiles, UPLC-SEC (shown only for H2-L2_H2-L2 species (in the Fig. denoted as H2-L2)), and DSC thermograms reported in FIG. 13 are typical for characterization of the variants carried out for certain hits. In this particular case these demonstrated well-expressed, homogeneous species with relatively minor impact on Fab stability.

The results in Table 21 demonstrate that transferability into the Mab format was achieved with notable success among the LCCA designs. 11 out of the 12 LCCA designs tested exhibited equal to or greater than the theoretical 25% correctly paired species. This data indicates that the LCCA format used as an initial screen of design library is suitable for assessing and/or predicting design success in the Mab format.

Example 13: Assessment of Preferential Pairing of Heterodimers in Co-Expression Sets Comprising Constant Domain and/or Variable Domain Modifications in a Bi-Specific Antibody Format

The heterodimer co-expression set designs were assessed to determine if they also allowed for preferential pairing in a bi-specific Mab antibody format. In this example, the Fc region of the full-length heavy chain of each heterodimer was asymmetrically modified to promote heterodimerization of the unique heavy chains compared to homodimerization.

Preparation of Constructs:

The heterodimer co-expression set designs were tested in the context of the following bi-specific antibodies: a) D3H44/pertuzumab, b) D3H44/trastuzumab, c) D3H44/ramucirumab, and d) trastuzumab/ramucirumab. The D3H44, pertuzumab, and trastuzumab heavy and light chains comprising amino acid modifications in the constant and/or variable domains were prepared as described in Example 12, except complementary Fc heterodimerization mutations were introduced into each two heavy chain of a co-expression design set. The ramucirumab heavy and light chains were prepared based on the base DNA sequence for the ramucirumab heavy chain and light chain. See Table A. The CH3 sequences of the heavy chains included the following amino acid modifications:

a) D3H44/pertuzumab: Chain A: T371V_T389L_K420L_T422W, Chain B: T371V_L372Y_F436A_Y438V

b) D3H44/trastuzumab: Chain A: T371V_T389L_K420L_T422W, Chain B: T371V_L372Y_F436A_Y438V

c) D3H44/ramucirumab: Chain A: T371V_T389L_K420L_T422W, Chain B: T371V_L372Y_F436A_Y438V

d) Trastuzumab/ramucirumab: Chain A: T371V_T389L_K420L_T422W, Chain B: T371V_L372Y_F436A_Y438V

Assay Format (SMCA)

The ability of the heterodimer co-expression set designs to preferentially pair to form a bi-specific antibody was assessed as described below. The assay is based on co-expressing the four chains (two from Ab 1 and two from Ab2) and detecting the presence of correctly formed bispecific antibody using mass spectrometry (LC-MS). The assay was carried out as follows. FIG. 14 provides a schematic depicting the four starting polypeptide chains and the potential products resulting from co-expression of these starting polypeptide chains in the absence of preferential pairing between heavy and light chains of the heterodimer pairs. Two full-length heavy chain constructs were co-expressed with two unique light chain constructs, yielding ten possible antibody species: H1-L1:H1-L1, H1-L2:H1-L2, H1-L1:H1-L2, H2-L1:H2-L1, H2-L2:H2-L2, H2-L1:H2-L2, H1-L1:H2-L1, H1-L2:H2-L2, H1-L2:H2-L1 and H1-L1:H2-L2. The latter species is the correctly paired heterodimer (see Fig. below). The relative pairing specificity in terms of amount of preferred species H1-L1:H2-L2 vs. others was determined using LC-MS after pA purification. When possible, chains were left untagged, provided all Mab and half-Ab species differed from each other by at least 50 Da. When mass differences precluded this possibility, one of the light chains was constructed with an N-terminal HA tag fusion in order to provide sufficient mass differentiation between species. We refer to this assay involving the expression and screening steps of a bispecific antibody as SMCA.

Transfection Method

Co-expression sets comprising two heavy chains and two light chain constructs prepared as described in Example 1 were transfected into CHO-3E7 cells as follows. CHO-3E7 cells, at a density of 1.7-2×10⁶cells/ml, were cultured at 37° C. in FreeStyle™ F17 medium (Invitrogen cat# A-1383501) supplemented with 4 mM glutamine and 0.1% Pluronic F-68 (Invitrogen cat#24040-032). A total volume of 20 ml were transfected with a total of 20 ug DNA using PEI-pro (Polyplus cat#115-010) at a DNA:PEI ratio of 1:2.5. Twenty-four hours after the addition of the DNA-PEI mixture, the cells were transferred to 32° C. Supernatants were tested for expression on day 7 by non-reducing SDS-PAGE analysis followed by Coommassie blue staining to visualize the bands. H1:H2: L1:L2 ratios used were initially kept neutral (15:15:35:35) to assess expression efficiency. A set of H1:H2:L1:L2 DNA ratios was then tested in CHO expressions to assess which condition(s) produced a mixture reflecting the theoretical distribution of the different species when all chains are wild-type. These ratios compensate for natural differences in expression levels and/or intrinsic pairing biases between heavy and light chains of the two different antibodies.

SPR Biosensor Assays

EDC: 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride; sNHS: N-hydroxysulfosuccinimide; SPR: surface plasmon resonance; EDTA: ethylenediaminetetraacetic acid; HEPES: 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid; TF: tissue factor.

Spr Supplies.

GLC sensorchips, the Biorad ProteOn amine coupling kit (EDC, sNHS and ethanolamine), and 10 mM sodium acetate buffers were purchased from Bio-Rad Laboratories (Canada) Ltd. (Mississauga, ON). Recombinant Her-2 protein was purchased from eBioscience (San Diego, Calif.). PBS running buffer with 0.05% Tween20 (PBST) was purchased from Teknoca Inc. (Hollister, Calif.). Goat polyclonal anti-human Fc antibody was purchased from Jackson Immuno Research Laboratories Inc. (West Grove, Pa.).

All surface plasmon resonance assays were carried out using a BioRad ProteOn XPR36 instrument (Bio-Rad Laboratories (Canada) Ltd. (Mississauga, ON)) with PBST running buffer at a temperature of 25° C. The anti-human Fc capture surface was generated using a GLC sensorchip activated by a 1:5 dilution of the standard BioRad sNHS/EDC solutions injected for 140 s at 100 μL/min in the analyte (horizontal) direction. Immediately after the activation, a 10 ug/mL solution of anti-human Fc antibody in 10 mM NaOAc pH 4.5 was injected in the ligand (vertical) direction at a flow rate of 25 μL/min until approximately 3000 resonance units (RUs) were immobilized. Remaining active groups were quenched by a 140 s injection of 1M ethanolamine at 100 μL/min in the analyte direction, and this also ensures mock-activated interspots are created for blank referencing. The screening of the antibody variants for binding to Her2 or TF antigen targets occurred in two steps: an indirect capture of the antibody variants onto the anti-human Fc antibody surface in the ligand direction followed by the simultaneous injection of 5 concentrations of purified antigen and one buffer blank for double referencing in the analyte direction. Firstly, one buffer injection for 30 s at 100 uL/min in the ligand direction was used to stabilize the baseline. For each antibody variant capture, unpurified variants in cell-culture media were diluted to 4% in PBST. One to five variants or controls were simultaneously injected in individual ligand channels for 240 s at flow 25 μL/min. This resulted in a capture of approximately 400 to 600 RUs onto the anti-human Fc surface. The first ligand channel was left empty to use as a blank control if required. This capture step was immediately followed by two buffer injections in the analyte direction to stabilize the baseline, and then 60 nM, 20 nM, 6.7 nM, 2.2 nM and 0.74 nM antigen (TF or Her2) along with a buffer blank was simultaneously injected at 50 μL/min for 120 s with a 300 s dissociation phase. The captured antibody surfaces were regenerated by an 18 s pulse of 0.85% phosphoric acid for 18 s at 100 L/min to prepare for the next injection cycle. Sensorgrams were aligned and double-referenced using the buffer blank injection and interspots, and the resulting sensorgrams were analyzed using ProteOn Manager software v3.1. The double-referenced sensorgrams were fit to the 1:1 binding model. Rmax values for each antigen were normalized to antibody capture levels for each variant and compared to 100% controls.

The stability of the bi-specific antibodies was tested as described in Example 5. The LCMS analysis of the bi-specific antibodies was carried out using the procedure described in Example 12.

A number of selected D3H44 LCCA designs were tested in the SMCA format. Furthermore, some designs were directly evaluated in the SMCA format only. A majority of designs were tested in the D3H44/Pertuzumab system. A subset of designs that were selected in this system (involving a variety of designs) were further tested in three additional bispecific Mab systems: D3H44/Trastuzumab, D3H44/Ramucirumab (RAMU), and Trastuzumab/Ramucirumab systems.

Inherent cross-system light/heavy chain preference observed for D3H44/Pertuzumab and D3H44/Trastuzumab in LCCA experiments (Example 11) was reproducible in the full antibody format as well. Furthermore, similar behavior, although with different cross-system pairing tendencies, was also observed for the other two systems (Table 22).

Desired bispecific species, H1-L1_H2-L2, cannot generally be distinguished experimentally on the basis of LC/MS from the mispaired type: H1-L2_H2-L1. As such, when bispecific content is reported in the tables, it cannot be completely excluded that it does not contain this type of mispaired species. However, the very low content observed for species such as H1-L2 H1-L2 and H2-L1 H2-L1 as well as H1-L2 and H2-L1 half antibodies is indicative that only minor if any contamination of the bispecific species occurred. All other species present in a particular sample measured by LC/MS are included in the table. In most cases MS peaks were accompanied by side peaks; however these were annotated only for the bispecific species. When normalizing MS peak intensities, these adduct species were not taken into account. The number represented for side peak(s) in the tables was obtained by intensity comparison with that for the main bispecific peak. Some preliminary analysis indicated the side peak as being correlated with the presence of light chain tags involving the formation of adducts or heterogeneity in the cleavage of leader peptides and it is likely representative of the main peak species.

Finally all paired species were summed up, in addition to mispaired species, to obtain the ratios reported in the paired: mispaired column. This was further used to calculate ΔScalar (VAR-REF_WT) following the mathematical approach described in Example 11 to demonstrate the strength of a particular design. Within a system, designs were ordered in descending values of the Scalar metric.

Due to the cross-system natural preference for pairing of light and heavy chains described in Example 12, DNA ratios of H1:L1:H2:L2 required alteration (e.g. over-expression of H1 (D3H44) over H2 (PERT)) to yield the highest content of desired bispecific species (H1-L1_H2-L2). Optimal ratios may also vary to some degree with a particular design as a result. DNA ratios mainly affect the actual ratio of bispecific species: paired half antibody species (usually of one type: H1-L1 or H2-L2). An example is included in Table 23, where DNA titration with this ratio varied but the overall ratio of paired: mispaired species was relatively constant.

In the case of the D3H44/Pertuzumab system, a limited set of DNA titrations was performed for the majority of designs. Data for ratios that resulted in the highest paired: mispaired species content are shown in Tables 22 and 24. For the other three systems, transfections were performed only at one ratio, as reported in the tables. If at the reported ratio the experiment was repeated, a mean value was included in the table. Information with respect to tag identity was not included as tag influence was not observed for the WT (Table 26). The WT reference provided was chosen as such to be representative of the most common DNA ratio among reported design data.

LC/MS analysis was performed on samples that were stored at pH4 as well as at pH7. Experiments on samples stored at pH7 were tested in the case of D3H44/Trastuzumab, D3H44/Ramucirumab (RAMU), and Trastuzumab/Ramucirumab systems. Hence data is presented in two tables (Table 22 (pH4) and 24 (pH7)). Satisfactory correlation was observed for paired: mispaired species ratios between the two experiments, indicating that pH did not likely play a substantial role in the LC/MS experiment, i.e., in characterization of the type of species present.

FIG. 15 presents the LC/MS analysis results of the bispecific construct targeting TF and Her2 based on the designs presented in FIG. 11 and FIG. 12 in the MCA format. Close to 92% of the preferred bispecific antibody with the correct pairing of obligate heavy and light chains was observed. FIG. 16 presents the expression profile of the protein product in the supernatant and following protein A purification in SDS PAGE as well as the SEC profile of the protein A purified compound. FIG. 17 presents the bispecific target binding features of the bispecific molecule, first to the two targets (TF and Her2 independently) and then in a sandwich (bridging) mode.

In some cases, the mutations at H/S 115 and H/S 156, featured in limited set of variants, were not part of the actual design, but rather were added for practical reasons to gain necessary mass difference for the purposes of a LC/MS experiment. These amino acids are located on the surface of the constant domain, sufficiently away from the actual design set of mutations and is not expected to impact the behavior of the antibody.

Thermal stability and antigen affinity were assessed for a number of designs in the D3H4/Pertuzumab system and are presented in Table 25. Tm values (indicated in italics) were annotated manually. Homodimeric Mab controls (WT) exhibited the following stability range of thermal melting (T_(m)) for product expressed in transfection repeats (at pH 7): PERT (72.03-77.72) and for D3H44 (77.97-78.88). The wider range observed for pertuzumab is likely due to its intrinsic properties. Affinity measurements were undertaken post pA purification only. Observed KD range for Homodimeric Mabs is: TF: 0.04-0.076 nM, HER2: 1.84-6.3 nM. In most cases one would expect two melting transitions due to the above indicated different ranges for the two Fabs. In cases where only one value for Tm is reported, it potentially arose due to one of two reasons: variation of pertuzumab stability or/and destabilization of the design on the D3H44 Fab that may coincide with that of pertuzumab Fab. Results indicate that selection of SMCA variants ranges from the ones that affect thermal stability minimally as well as antigen binding to the ones that do so to varying degrees.

Some designs were tested with both possible Fc mutation placements. These are identifiable by having the same unique identifier and denoted with *D3H44 (or in Table 25 with # preceding unique identifier set). As evident from Table 22, placement of Fc mutations does not appear to influence the paired: mispaired species outcome with a limited exception.

The SMCA results show that a substantial number of designs representing a diverse set can overcome the natural cross-species pairing tendency in the Mab format. Among these, close to a quarter of the designs tested fall into the category of high paired:mispaired ratio (>=80:20) (numbers are provided for the more exhaustively tested system, D3H44/Pertuzmab). Comparison of design effectiveness across different systems reveals varying degrees of transferability. Furthermore, approximately 60% of designs, listed in Table 27, transported from the D3H44 LCCA into the SMCA format resulted in a high degree of preferential pairing (>75%: 25%) in at least one of the four tested SMCA systems.

For designs tested in systems other than D3H44/Pertuzumab, design placement was inverted with respect to the binding domains as well, e.g. H1-L1 designs on D3H44 binding domain, H2-L2 designs on TRAS as well as H1-L1 designs on TRAS and H2-L2 designs on D3H44. The results demonstrate that a design's effectiveness in a majority of cases could be impacted by such a flipped placement.

The results discussed above indicate that a design's transferability can be impacted by a combination of antigen binding domains coupled with the driving potential of design constituents (e.g. H1L1L2 driving may be better than H2L2L1). The results indicate that a large number of base core designs work across a range of Mab pairs to form a bispecific Ab with greater than 75% selective pairing.

Example 14: Molecular Modeling and Computer Guided Engineering of Fab Interface

We employed a structure and computational molecular modeling guided approach to produce a library of heavy and light chain mutation designs that can be screened in the context of other antibodies or fragments thereof to identify mutations that exhibit the desired specificity in the antibodies of interest. The design strategy for engineering preferential HC-LC pairing included first identifying a representative Ab (i.e. D3H44) to work with. Key criteria of this Ab are shown in Table 28.

TABLE 28 Criteria Importance Human or humanized IgG1/κ Similarity Has commonly used V_(H) and V_(L) subgroups Framework close to germline V_(H):V_(L) interdomain packing angle close to observed average for Fabs Structure available for apo- and complexed Fab Design No major structural changes observed upon binding antigen Antigen binding can be readily assayed Assay

As indicated in Table 28, key criteria presented by this antibody was that it was a human/humanized Ab, with commonly used VH and VL subgroups and minimal framework region mutations. In addition, structural considerations were that the VH:VL interdomain angle should be close to the average observed for Abs. After selection of the Fab, an in silico analysis of the Fab interface was carried with the aim being to identify and understand the important residues. A two-pronged approach was taken. First, a global analysis of the sequence conservation across the Fab variable and constant interfaces was carried out via sequence and structure alignments of publicly available Abs. An alignment of constant and variable domain sequences from various antibody subgroups is shown in FIG. 6. In parallel, the crystal structure interface D3H44 was analyzed using a number of molecular modeling tools listed in FIG. 18 (e.g. ResidueContacts™). These analyses resulted in the identification of a list of hotspot positions for engineering preferential HC-LC pairing. The hotspot positions determined from this analysis are listed in Table 29.

TABLE 29 Hotspot amino acid positions at the interface of the heavy (H) and light (L) chain in a typical Fab derived from human VH and VL kappa chains. These positions and amino acids are also mostly conserved in the VL lambda chains. These are mainly framework residues except for a few located in the CDR3 loops. The amino acids in the parent D3H44 sequences with Kabat numbering are provided in Tables A1-A2. H (Kabat) L (Kabat) V37 Y36 Q39 Q38 L45 P44 W47 L89 F100 F98 W103 F116 L124 F118 A139 V133 F174 L135

Next, potential mutations and designs at the hotspot positions as well as positions neighboring the hotspots of interest in the 3D crystal structure were simulated and identified via in silico mutagenesis and packing/modeling with Zymepack™ and scored on the basis of a number of factors including steric and electrostatic complementarity. FIG. 11 presents a limited number of hotspot positions at the heavy and light chain interface in the variable domains and how mutations can be introduced at these interface positions to facilitate selective pairing of the obligate chains while disfavoring the formation of incorrect chain pairs. Steric complementarity was modeled and also computed on the basis of energy factors such as van der Waals packing, cavitation effects and close contact of hydrophobic groups. Similarly, electrostatic interaction energies were modeled and evaluated on the basis of coulomb interactions between charges, hydrogen bonds, and desolvation effects. Both the preferred heavy and light chain pair models such as H1:L1 (or H2:L2) and incorrect pair such as H1:L2 (and H2:L1) obtained by introducing the mutations of interest were simulated to compute the relative steric and electrostatic scores. This allowed us to decide if a particular mutation set lead to favorable energies i.e. greater steric or electrostatic complementarity for the preferred (obligate) heavy-light chain pairs relative to the incorrect (non-obligate) pairs. The computed steric and electrostatic energies are components of the free energy associated with the light and heavy chain pairing. Hence greater steric and electrostatic complementarity is indicative of a larger free energy change associated with the pairing of the obligate pair relative to the pairing of the non-obligate pair. The greater steric or electrostatic complementarity results in preferential (selective) pairing of the obligate heavy and light chains relative to the non-obligate pair and can be detected in terms of the percentage of the two products (obligate paired vis-à-vis the non-obligate paired heavy and light chain) upon co-expression. The greater steric or electrostatic complementarity in the obligate pair can also be typically observed in terms of better/greater thermal stability relative to the non-obligate pair. Candidate designs were shortlisted and ranked. Designs were initially tested in vitro using the LCCA system. Moderately performing designs displaying non-optimal biophysical characteristics such as poor HC-LC pairing specificity, low thermal stability, or reduced antigen binding affinity, were further improved via additional rounds of in silico design and in vitro screening. The best designs were then tested in a bispecific Mab format; in vitro screening primarily being via the SMCA format.

Example 15: Generation of Bispecific Antibody Given Mab1 and Mab2 Using a Library of Bispecific Antibody Mutation Design Sets

In one embodiment, presented here is a bispecific antibody mutation design set aimed at selectively forming bispecific antibodies given two canonical antibodies Mab 1 and Mab2 comprising of the antigen binding fragments Fab1 and Fab2 respectively. The design set consists of cognate mutations corresponding to Fab 1, Fab2 and Fc respectively. Mutations are introduced at the interface of light and heavy chain of Fab 1 to achieve selective pairing between the two obligate chains in the presence of competing light and heavy chain of Fab2. Selective pairing is achieved by introducing favorable complementary mutations in the two obligate light and heavy chains on the basis of steric, hydrophobic or electrostatic complementarity between certain hotspot framework residues at the interface while involving these mutated residues in unfavorable interface interaction for the non-obligate chain pairs. In each design set selective pairing mutations can also be introduced at the interface of light and heavy chain of Fab2 to achieve selective pairing between these two obligate chains in the presence of competing light and heavy chain of Fab 1. The mutations are aimed at reducing the mis-pairing of light chain from Fab 1 with heavy chain of Fab2 and vice-versa. Mutations are introduced at the Fc interface in order to achieve selective pairing of heavy chains to form asymmetric antibody molecules comprising two different heavy chains.

Engineering at certain interface residue positions of light and heavy chains of an antibody can often lead to detrimental effects such as loss in antigen binding affinity, stability, solubility, aggregation propensity etc of that antibody. A number of related properties can be affected such as kon and koff rates, melting temperature (Tm), stability to stress conditions like acid, base, oxidation, freeze/thaw, agitation, pressure etc. This is often impacted by the complementarity determining regions (CDR's) of the antibody of interest. Given that the CDR's of antibodies are not generally the same, the impact of the mutation design set may not be the same across all antibodies. In another embodiment, a number of different bispecific mutation design sets constituting a library of bispecific antibody mutation design sets are defined that involve mutations at different hotspot positions at the interface. A library of such bispecific antibody mutation design sets is shown in Table 30. Presented here is a method to create a bispecific antibody with noted purity relative to other contaminants containing incorrectly paired antibody-like structures, given any two available antibodies Mab 1 and Mab2. The light and heavy chains of Mab 1 and Mab2 are co-expressed after introducing the cognate mutations of each of the mutation design sets and the expressed antibody product is analytically screened to estimate the purity of the preferred bispecific antibody relative to other Mab like species expressed in the protein product. In some embodiments the analytical screening procedure may be based on an LC-MS technique. In some embodiments the analytical screening procedure may be based on charge based separation such as a capillary isoelectric focusing (cIEF) technique. An example of the screening technique is presented in Example 13 based on the SMCA procedure. In some embodiments the noted purity of the bispecific antibody is defined as being greater than 70% of all the obtained Mab like species in the expressed protein product. In some embodiments the noted purity of the bispecific antibody is defined as being greater than 90% of all the obtained Mab like species in the expressed protein product. The procedure for preparation and selection of bispecific Mab design set given Mab 1 and Mab2 is shown schematically in FIG. 19.

All publications, patents and patent applications mentioned in this specification are herein incorporated by reference into the specification to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference.

While the invention has been particularly shown and described with reference to a preferred embodiment and various alternate embodiments, it will be understood by persons skilled in the relevant art that various changes in form and details can be made therein without departing from the spirit and scope of the invention.

All references, issued patents and patent applications cited within the body of the instant specification are hereby incorporated by reference in their entirety, for all purposes.

TABLE A SEQUENCES SEQ ID NO DESCRIPTION SEQUENCE  4 D3H44 light DIQMTQSPSSLSASVGDRVTITCRASRDIKSYLNWYQQKPGKAPKVLIYYATSLAE chain (Domain GVPSRFSGSGSGTDYTLTISSLQPEDFATYYCLQHGESPWTFGQGTKVEIKRTVAA boundaries: PSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDS VL; D1- KDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC K107, CL; R108-C214)  5 Pertuzumab DIQMTQSPSSLSASVGDRVTITCKASQDVSIGVAWYQQKPGKAPKLLIYSASYRYT light chain GVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQYYTYPYTFGQGTKVEIKRTVAA (Domain PSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDS boundaries: KDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC VL; D1- K107, CL; R108-C214)  6 Trastuzumab DIQMTQSPSSLSASVGDRVTITCRASQDVNTAVAWYQQKPGKAPKLLIYSASFLYS light chain GVPSRFSGSRSGTDFTLTISSLQPEDFATYYCQQHYTTPPTFGQGTKVEIKRTVAA (Domain PSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDS boundaries: KDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC VL; D1- K107, CL; R108-C214)  7 Ramucirumab DIQMTQSPSSVSASIGDRVTITCRASQGIDNWLGWYQQKPGKAPKLLIYDASNLDT light chain GVPSRFSGSGSGTYFTLTISSLQAEDFAVYFCQQAKAFPPTFGGGTKVDIKGTVAA (Domain PSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDS boundaries: KDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC VL; D1- K107, CL; G108-C214)  8 D3H44 heavy EVQLVESGGGLVQPGGSLRLSCAASGFNIKEYYMHWVRQAPGKGLEWVGLIDPEQG chain (Domain NTIYDPKFQDRATISADNSKNTAYLQMNSLRAEDTAVYYCARDTAAYFDYWGQGTL boundaries: VTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVH VH; E1- TFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCDKTH S113, CH1; TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD A114-K223, GVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI Hinge; E226- SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYK P243, CH2; TTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG A244-K360, CH3; G361- G477)  9 Pertuzumab EVQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPNSG heavy chain GSIYNQRFKGRFTLSVDRSKNTLYLQMNSLRAEDTAVYYCARNLGPSFYFDYWGQG (Domain TLVTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSG boundaries: VHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCDK VH; E1- THTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWY S113, CH1; VDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK A114-K223, TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENN Hinge; E226- YKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG P243, CH2; A244-K360, CH3; G361- G477) 10 Trastuzumab EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARTYPTNG heavy chain YTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDYWGQ (Domain GTLVTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTS boundaries: GVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCD VH; E1- KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNW S113, CH1; YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE A114-K223, KTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPEN Hinge; E226- NYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP P243, CH2; G A244-K360, CH3; G361- G477) 11 Ramucirumab EVQLVQSGGGLVKPGGSLRLSCAASGFTFSSYSMNWVRQAPGKGLEWVSSISSSSS heavy chain YIYYADSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARVTDAFDIWGQGTMV (Domain TVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHT boundaries: FPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCDKTHT VH; E1- CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDG S113, CH1; VEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS A114-K223, KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKT Hinge; E226- TPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG P243, CH2; A244-K360, CH3; G361- G477) 12 Trastuzumab_HC GCCACCATGGCCGTGATGGCTCCTAGAACCCTGGTGCTGCTGCTGTCTGGAGCTCT GGCTCTGACTCAGACCTGGGCTGGAGAGGTGCAGCTGGTGGAAAGCGGAGGAGGAC TGGTGCAGCCAGGAGGATCTCTGCGACTGAGTTGCGCCGCTTCAGGATTCAACATC AAGGACACCTACATTCACTGGGTGCGACAGGCTCCAGGAAAAGGACTGGAGTGGGT GGCTCGAATCTATCCCACTAATGGATACACCCGGTATGCCGACTCCGTGAAGGGGA GGTTTACTATTAGCGCCGATACATCCAAAAACACTGCTTACCTGCAGATGAACAGC CTGCGAGCCGAAGATACCGCTGTGTACTATTGCAGTCGATGGGGAGGAGACGGATT CTACGCTATGGATTATTGGGGACAGGGGACCCTGGTGACAGTGAGCTCCGCCTCTA CCAAGGGCCCCAGTGTGTTTCCCCTGGCTCCTTCTAGTAAATCCACCTCTGGAGGG ACAGCCGCTCTGGGATGTCTGGTGAAGGACTATTTCCCCGAGCCTGTGACCGTGAG TTGGAACTCAGGCGCCCTGACAAGCGGAGTGCACACTTTTCCTGCTGTGCTGCAGT CAAGCGGGCTGTACTCCCTGTCCTCTGTGGTGACAGTGCCAAGTTCAAGCCTGGGC ACACAGACTTATATCTGCAACGTGAATCATAAGCCCTCAAATACAAAAGTGGACAA GAAAGTGGAGCCCAAGAGCTGTGATAAGACCCACACCTGCCCTCCCTGTCCAGCTC CAGAACTGCTGGGAGGACCTAGCGTGTTCCTGTTTCCCCCTAAGCCAAAAGACACT CTGATGATTTCCAGGACTCCCGAGGTGACCTGCGTGGTGGTGGACGTGTCTCACGA GGACCCCGAAGTGAAGTTCAACTGGTACGTGGATGGCGTGGAAGTGCATAATGCTA AGACAAAACCAAGAGAGGAACAGTACAACTCCACTTATCGCGTCGTGAGCGTGCTG ACCGTGCTGCACCAGGACTGGCTGAACGGGAAGGAGTATAAGTGCAAAGTCAGTAA TAAGGCCCTGCCTGCTCCAATCGAAAAAACCATCTCTAAGGCCAAAGGCCAGCCAA GGGAGCCCCAGGTGTACACACTGCCACCCAGCAGAGACGAACTGACCAAGAACCAG GTGTCCCTGACATGTCTGGTGAAAGGCTTCTATCCTAGTGATATTGCTGTGGAGTG GGAATCAAATGGACAGCCAGAGAACAATTACAAGACCACACCTCCAGTGCTGGACA GCGATGGCAGCTTCTTCCTGTATTCCAAGCTGACAGTGGATAAATCTCGATGGCAG CAGGGGAACGTGTTTAGTTGTTCAGTGATGCATGAAGCCCTGCACAATCATTACAC TCAGAAGAGCCTGTCCCTGTCTCCCGGCTGA 13 Trastuzumab_LC GCCACTATGGCTGTGATGGCCCCTAGGACCCTGGTGCTGCTGCTGTCCGGAGCTCT GGCTCTGACTCAGACCTGGGCTGGAGACATCCAGATGACCCAGTCTCCATCCTCCC TGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGCCGGGCAAGTCAGGACGTT AACACCGCTGTAGCTTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGAT CTATTCTGCATCCTTTTTGTACAGTGGGGTCCCATCAAGGTTCAGTGGCAGTCGAT CTGGGACAGATTTCACTCTCACCATCAGCAGTCTGCAACCTGAAGATTTTGCAACT TACTACTGTCAACAGCATTACACTACCCCACCCACTTTCGGCCAAGGGACCAAAGT GGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATG AGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCC AGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGCCCTCCAATCGGGTAACTCCCA AGAGAGTGTCACAGAGCAGGACAGCAAGGACAGCACCTACAGCCTCAGCAGCACCC TGACGCTGAGCAAAGCAGACTACGAGAAACACAAAGTCTACGCCTGCGAAGTCACC CATCAGGGCCTGAGCTCGCCCGTCACAAAGAGCTTCAACAGGGGAGAGTGTTGA 14 Ramucirumab_HC GCCACCATGGCCGTGATGGCTCCTAGAACACTGGTCCTGCTGCTGTCAGGGGCACT GGCACTGACTCAGACTTGGGCTGGGGAGGTCCAGCTGGTCCAGTCCGGAGGAGGAC TGGTGAAGCCTGGAGGGAGTCTGCGACTGTCATGCGCCGCTAGCGGGTTCACCTTT AGCTCCTACAGCATGAACTGGGTGCGACAGGCACCAGGCAAAGGACTGGAATGGGT GTCTAGTATCTCAAGCTCCTCTAGTTACATCTACTATGCAGACAGCGTGAAGGGCC GGTTCACCATCAGCAGAGATAACGCCAAAAATTCCCTGTATCTGCAGATGAACAGC CTGCGAGCCGAGGACACCGCTGTCTACTATTGCGCACGGGTGACAGACGCCTTCGA TATTTGGGGACAGGGCACCATGGTCACAGTGTCAAGCGCCTCCACCAAGGGACCAA GCGTGTTCCCACTGGCTCCATCCTCTAAAAGCACTTCCGGAGGAACCGCAGCCCTG GGATGTCTGGTGAAGGATTACTTCCCAGAGCCCGTCACAGTGTCATGGAACAGCGG GGCTCTGACCTCTGGAGTCCACACATTTCCAGCAGTGCTGCAGAGTTCAGGACTGT ACAGCCTGAGCTCCGTGGTCACAGTGCCCTCTAGTTCACTGGGCACTCAGACCTAT ATCTGCAACGTGAATCACAAGCCAAGCAATACTAAAGTCGACAAGAAAGTGGAACC CAAGTCCTGTGATAAAACACATACTTGCCCACCTTGTCCTGCACCAGAGCTGCTGG GAGGACCATCCGTGTTCCTGTTTCCACCCAAGCCTAAAGACACTCTGATGATTTCT AGGACACCCGAAGTCACTTGCGTGGTCGTGGACGTGAGCCACGAGGACCCCGAAGT CAAGTTTAACTGGTACGTGGATGGCGTCGAGGTGCATAATGCTAAGACAAAACCTA GGGAGGAACAGTACAACAGTACATATAGAGTCGTGTCAGTCCTGACTGTGCTGCAT CAGGACTGGCTGAACGGAAAGGAATATAAGTGCAAAGTGAGCAATAAGGCTCTGCC CGCACCTATCGAGAAAACTATTTCCAAGGCTAAAGGCCAGCCTAGAGAACCACAGG TGTACACCCTGCCTCCATCTAGGGACGAGCTGACTAAGAACCAGGTCAGTCTGACC TGTCTGGTGAAAGGCTTCTATCCTAGCGATATCGCAGTGGAGTGGGAATCCAATGG GCAGCCAGAGAACAATTACAAGACCACACCCCCTGTGCTGGACTCCGATGGGTCTT TCTTTCTGTATAGTAAGCTGACCGTCGATAAATCACGGTGGCAGCAGGGAAACGTG TTCAGCTGTAGTGTCATGCACGAAGCACTGCACAATCATTACACCCAGAAGAGCCT GTCACTGTCACCCGGATGA 15 Ramucirumab_LC GCCACCATGGCTGTGATGGCACCTAGAACACTGGTCCTGCTGCTGTCCGGGGCACT GGCACTGACTCAGACTTGGGCTGGCGATATTCAGATGACCCAGAGTCCAAGCTCCG TGTCCGCCTCTATCGGCGACCGAGTCACCATTACATGCAGAGCTAGCCAGGGCATC GATAACTGGCTGGGGTGGTACCAGCAGAAGCCTGGAAAAGCCCCAAAGCTGCTGAT CTACGACGCTTCCAATCTGGATACAGGCGTGCCCTCTAGGTTCAGTGGCTCAGGGA GCGGAACTTACTTTACTCTGACCATCTCTAGTCTGCAGGCTGAGGACTTCGCAGTG TATTTTTGCCAGCAGGCAAAAGCCTTCCCCCCTACCTTTGGCGGGGGAACAAAAGT CGACATCAAGGGGACCGTGGCCGCTCCCTCAGTCTTCATTTTTCCACCCAGCGATG AGCAGCTGAAGTCTGGAACAGCCAGTGTGGTCTGTCTGCTGAACAATTTCTACCCT CGGGAAGCAAAAGTGCAGTGGAAGGTCGACAACGCCCTGCAGTCCGGCAACAGCCA GGAGAGTGTGACTGAACAGGACTCAAAAGATAGCACCTATTCCCTGTCAAGCACAC TGACTCTGTCCAAGGCTGATTACGAAAAGCACAAAGTGTATGCATGTGAGGTCACC CATCAGGGGCTGTCAAGTCCAGTCACAAAAAGTTTCAACCGAGGAGAGTGCTGA 16 D3H44_HC GCCACAATGGCCGTGATGGCTCCTAGAACACTGGTCCTGCTGCTGTCCGGGGCTCT GGCTCTGACTCAGACTTGGGCTGGGGAGGTGCAGCTGGTCGAATCTGGAGGAGGAC TGGTGCAGCCAGGAGGGTCACTGAGACTGAGCTGCGCCGCTTCCGGCTTCAACATC AAGGAGTACTATATGCACTGGGTGAGGCAGGCACCTGGCAAAGGACTGGAGTGGGT GGGACTGATCGACCCAGAACAGGGGAACACCATCTACGACCCTAAGTTTCAGGATA GGGCAACCATTTCTGCCGACAACAGTAAAAATACAGCTTATCTGCAGATGAACAGC CTGAGGGCTGAAGATACTGCAGTGTACTATTGCGCACGCGACACCGCAGCCTACTT CGATTATTGGGGACAGGGCACCCTGGTCACAGTGAGCTCCGCATCAACTAAGGGAC CCAGCGTGTTTCCACTGGCCCCCTCTAGTAAATCCACTTCTGGAGGCACCGCTGCA CTGGGCTGTCTGGTGAAGGATTACTTCCCAGAGCCCGTCACAGTGAGCTGGAACTC CGGGGCCCTGACCAGCGGAGTCCATACATTTCCTGCTGTGCTGCAGTCAAGCGGGC TGTACTCCCTGTCCTCTGTGGTCACCGTGCCAAGTTCAAGCCTGGGAACTCAGACC TATATCTGCAACGTGAATCACAAGCCTTCAAATACAAAAGTCGACAAGAAAGTGGA ACCAAAGAGCTGTGATAAAACACATACTTGCCCACCTTGTCCTGCACCAGAGCTGC TGGGAGGACCAAGCGTGTTCCTGTTTCCACCCAAGCCCAAAGACACCCTGATGATT TCCCGCACACCAGAAGTCACTTGCGTGGTCGTGGACGTGTCTCACGAGGACCCCGA AGTCAAGTTCAACTGGTACGTGGATGGCGTCGAGGTGCATAATGCCAAGACAAAAC CCCGGGAGGAACAGTACAACTCCACATATAGAGTCGTGTCTGTCCTGACTGTGCTG CACCAGGACTGGCTGAACGGGAAGGAGTATAAGTGCAAAGTGAGTAATAAGGCCCT GCCCGCTCCTATCGAGAAAACAATTAGCAAGGCCAAAGGCCAGCCTCGAGAACCAC AGGTGTACACTCTGCCTCCATCTCGGGACGAGCTGACTAAGAACCAGGTCAGTCTG ACCTGTCTGGTGAAAGGATTCTATCCCAGCGATATCGCTGTGGAGTGGGAATCCAA TGGCCAGCCTGAGAACAATTACAAGACCACACCCCCTGTGCTGGACTCTGATGGCA GTTTCTTTCTGTATAGTAAGCTGACCGTCGATAAATCACGATGGCAGCAGGGGAAC GTGTTCAGCTGTTCAGTGATGCACGAAGCCCTGCACAACCATTACACCCAGAAGAG CCTGAGCCTGTCTCCCGGCTGA 17 D3H44_LC GCCACAATGGCTGTGATGGCACCCCGAACCCTGGTCCTGCTGCTGAGTGGAGCACT GGCACTGACCCAGACATGGGCAGGCGACATCCAGATGACACAGTCCCCTAGCTCCC TGAGTGCCTCAGTGGGGGACAGAGTCACTATCACCTGCCGGGCTTCCAGAGATATT AAGTCTTACCTGAACTGGTATCAGCAGAAGCCAGGCAAAGCACCCAAGGTGCTGAT CTACTATGCCACCAGTCTGGCTGAAGGAGTGCCTTCACGGTTCAGCGGCTCCGGGT CTGGAACTGACTACACACTGACTATTTCTAGTCTGCAGCCTGAGGATTTCGCTACC TACTATTGCCTGCAGCACGGCGAATCCCCATGGACTTTTGGCCAGGGGACCAAAGT GGAGATCAAGAGGACAGTGGCCGCTCCATCCGTCTTCATTTTTCCCCCTTCTGACG AACAGCTGAAATCAGGAACTGCCAGCGTGGTCTGTCTGCTGAACAATTTCTACCCC CGCGAGGCAAAAGTGCAGTGGAAGGTCGATAACGCCCTGCAGAGTGGCAATTCACA GGAGAGCGTGACAGAACAGGACTCCAAAGATTCTACTTATAGTCTGTCAAGCACCC TGACACTGTCTAAGGCTGATTACGAGAAGCACAAAGTGTATGCATGCGAAGTCACC CATCAGGGGCTGTCCTCTCCCGTGACAAAGAGCTTTAATCGGGGAGAGTGTTGA 18 Pertuzumab_HC GCCACAATGGCTGTGATGGCTCCAAGAACCCTGGTCCTGCTGCTGTCCGGGGCTCT GGCTCTGACTCAGACCTGGGCCGGGGAAGTGCAGCTGGTCGAATCTGGAGGAGGAC TGGTGCAGCCAGGAGGGTCCCTGCGCCTGTCTTGCGCCGCTAGTGGCTTCACTTTT ACCGACTACACCATGGATTGGGTGCGACAGGCACCTGGAAAGGGCCTGGAGTGGGT CGCCGATGTGAACCCAAATAGCGGAGGCTCCATCTACAACCAGCGGTTCAAGGGCC GGTTCACCCTGTCAGTGGACCGGAGCAAAAACACCCTGTATCTGCAGATGAATAGC CTGCGAGCCGAAGATACTGCTGTGTACTATTGCGCCCGGAATCTGGGGCCCTCCTT CTACTTTGACTATTGGGGGCAGGGAACTCTGGTCACCGTGAGCTCCGCCTCCACCA AGGGACCTTCTGTGTTCCCACTGGCTCCCTCTAGTAAATCCACATCTGGGGGAACT GCAGCCCTGGGCTGTCTGGTGAAGGACTACTTCCCAGAGCCCGTCACAGTGTCTTG GAACAGTGGCGCTCTGACTTCTGGGGTCCACACCTTTCCTGCAGTGCTGCAGTCAA GCGGGCTGTACAGCCTGTCCTCTGTGGTCACCGTGCCAAGTTCAAGCCTGGGAACA CAGACTTATATCTGCAACGTGAATCACAAGCCATCCAATACAAAAGTCGACAAGAA AGTGGAACCCAAGTCTTGTGATAAAACCCATACATGCCCCCCTTGTCCTGCACCAG AGCTGCTGGGAGGACCAAGCGTGTTCCTGTTTCCACCCAAGCCTAAAGATACACTG ATGATTAGTAGGACCCCAGAAGTCACATGCGTGGTCGTGGACGTGAGCCACGAGGA CCCCGAAGTCAAGTTTAACTGGTACGTGGACGGCGTCGAGGTGCATAATGCCAAGA CTAAACCCAGGGAGGAACAGTACAACAGTACCTATCGCGTCGTGTCAGTCCTGACA GTGCTGCATCAGGATTGGCTGAACGGGAAAGAGTATAAGTGCAAAGTGAGCAATAA GGCTCTGCCCGCACCTATCGAGAAAACAATTTCCAAGGCAAAAGGACAGCCTAGAG AACCACAGGTGTACACTCTGCCTCCATCAAGGGATGAGCTGACAAAGAACCAGGTC AGCCTGACTTGTCTGGTGAAAGGATTCTATCCCTCTGACATTGCTGTGGAGTGGGA AAGTAATGGCCAGCCTGAGAACAATTACAAGACCACACCCCCTGTGCTGGACTCAG ATGGCAGCTTCTTTCTGTATAGCAAGCTGACCGTCGACAAATCCCGGTGGCAGCAG GGGAATGTGTTTAGTTGTTCAGTCATGCACGAGGCACTGCACAACCATTACACCCA GAAGTCACTGTCACTGTCACCAGGGTGA 19 Pertuzumab_LC GCCACAATGGCTGTGATGGCACCTAGAACACTGGTCCTGCTGCTGAGCGGGGCACT GGCACTGACACAGACTTGGGCCGGGGATATTCAGATGACCCAGTCCCCAAGCTCCC TGAGTGCCTCAGTGGGCGACCGAGTCACCATCACATGCAAGGCTTCCCAGGATGTG TCTATTGGAGTCGCATGGTACCAGCAGAAGCCAGGCAAAGCACCCAAGCTGCTGAT CTATAGCGCCTCCTACCGGTATACCGGCGTGCCCTCTAGATTCTCTGGCAGTGGGT CAGGAACAGACTTTACTCTGACCATCTCTAGTCTGCAGCCTGAGGATTTCGCTACC TACTATTGCCAGCAGTACTATATCTACCCATATACCTTTGGCCAGGGGACAAAAGT GGAGATCAAGAGGACTGTGGCCGCTCCCTCCGTCTTCATTTTTCCCCCTTCTGACG AACAGCTGAAAAGTGGCACAGCCAGCGTGGTCTGTCTGCTGAACAATTTCTACCCT CGCGAAGCCAAAGTGCAGTGGAAGGTCGATAACGCTCTGCAGAGCGGCAACAGCCA GGAGTCTGTGACTGAACAGGACAGTAAAGATTCAACCTATAGCCTGTCAAGCACAC TGACTCTGAGCAAGGCAGACTACGAGAAGCACAAAGTGTATGCCTGCGAAGTCACA CATCAGGGGCTGTCCTCTCCTGTGACTAAGAGCTTTAACAGAGGAGAGTGTTGA Note: The nucleotide sequences start with the signal peptide sequence and end with the stop codon TGA. These can be removed if desired. Tables A1-A2: Heavy chain and light chain amino acid sequences of D3H44, Trastuzumab, Pertuzumab and Ramucirumab

TABLES A1-A2 Heavy chain and light chain amino acid sequences of D3H44, Trastuzumab, Pertuzumab and Ramucirumab Table A1 Heavy chain origin PERTU- TRASTU- D3H44 ZUMAB ZUMAB RAMUCIRUMAB KABAT (SEQ ID (SEQ ID (SEQ ID (SEQ ID numbering NO: 8) NO: 9) NO: 10) NO: 11)  1 E E E E  2 V V V V  3 Q Q Q Q  4 L L L L  5 V V V V  6 E E E Q  7 S S S S  8 G G G G  9 G G G G  10 G G G G  11 L L L L  12 V V V V  13 Q Q Q K  14 P P P P  15 G G G G  16 G G G G  17 S S S S  18 L L L L  19 R R R R  20 L L L L  21 S S S S  22 C C C C  23 A A A A  24 A A A A  25 S S S S  26 G G G G  27 F F F F  28 N T N T  29 I F I F  32 K T K S  33 E D D S  34 Y Y T Y  35 Y T Y S  35A M M I M  35B H D H N  36 W W W W  37 V V V V  38 R R R R  39 Q Q Q Q  40 A A A A  41 P P P P  42 G G G G  43 K K K K  44 G G G G  45 L L L L  46 E E E E  47 W W W W  48 V V V V  49 G A A S  50 L D R S  51 I V I I  52 D N Y S  52A P P P S  52B E N T S  54 Q S N S  55 G G G S  56 N G Y Y  57 T S T I  58 I I R Y  59 Y Y Y Y  60 D N A A  61 P Q D D  62 K R S S  63 F F V V  64 Q K K K  65 D G G G  66 R R R R  67 A F F F  68 T T T T  69 I L I I  70 S S S S  71 A V A R  72 D D D D  73 N R T N  74 S S S A  75 K K K K  76 N N N N  77 T T T S  78 A L A L  79 Y Y Y Y  80 L L L L  81 Q Q Q Q  82 M M M M  82A N N N N  82B S S S S  82C L L L L  83 R R R R  84 A A A A  85 E E E E  86 D D D D  87 T T T T  88 A A A A  89 V V V V  90 Y Y Y Y  91 Y Y Y Y  92 C C C C  93 A A S A  94 R R R R  95 D N W V  96 T L G T  97 A G G D  98 P D G S F  98 A F Y  99 Y Y A A 100 F F M F 101 D D D D 102 Y Y Y I 103 W W W W 104 G G G G 105 Q Q Q Q 106 G G G G 107 T T T T 108 L L L M 109 V V V V 110 T T T T 111 V V V V 112 S S S S 113 S S S S 114 A A A A 115 S S S S 116 T T T T 117 K K K K 118 G G G G 119 P P P P 120 S S S S 121 V V V V 122 F F F F 123 P P P P 124 L L L L 125 A A A A 126 P P P P 127 S S S S 128 S S S S 129 K K K K 130 S S S S 133 T T T T 134 S S S S 135 G G G G 136 G G G G 137 T T T T 138 A A A A 139 A A A A 140 L L L L 141 G G G G 142 C C C C 143 L L L L 144 V V V V 145 K K K K 146 D D D D 147 Y Y Y Y 148 F F F F 149 P P P P 150 E E E E 151 P P P P 152 V V V V 153 T T T T 154 V V V V 156 S S S S 157 W W W W 162 N N N N 163 S S S S 164 G G G G 165 A A A A 166 L L L L 167 T T T T 168 S S S S 169 G G G G 171 V V V V 172 H H H H 173 T T T T 174 F F F F 175 P P P P 176 A A A A 177 V V V V 178 L L L L 179 Q Q Q Q 180 S S S S 182 S S S S 183 G G G G 184 L L L L 185 Y Y Y Y 186 S S S S 187 L L L L 188 S S S S 189 S S S S 190 V V V V 191 V V V V 192 T T T T 193 V V V V 194 P P P P 195 S S S S 196 S S S S 197 S S S S 198 L L L L 199 G G G G 200 T T T T 203 Q Q Q Q 205 T T T T 206 Y Y Y Y 207 I I I I 208 C C C C 209 N N N N 210 V V V V 211 N N N N 212 H H H H 213 K K K K 214 P P P P 215 S S S S 216 N N N N 217 T T T T 218 K K K K 219 V V V V 220 D D D D 221 K K K K 222 K K K K 223 V V V V 226 E E E E 227 P P P P 228 K K K K 232 S S S S 233 C C C C 234 D D D D 235 K K K K 236 T T T T 237 H H H H 238 T T I T 239 C C C C 240 P P P P 241 P p P P 242 C C C C 243 P P P P 244 A A A A 245 P P P P 246 E E E E 247 L L L L 248 L L L L 249 G G G G 250 G G G G 251 P P P P 252 S S S S 253 V V V V 254 F F F F 255 L L L L 256 F F F F 257 P P P P 258 P P P P 259 K K K K 260 P P P P 261 K K K K 262 D D D D 263 T T T T 264 L L L L 265 M M M M 266 I I I I 267 S S S S 268 R R R R 269 T T T T 270 P P P P 271 E E E E 272 V V V V 273 T T T T 274 C C C C 275 V V V V 276 V V V V 277 V V V V 278 D D D D 279 V V V V 280 S S S S 281 H H H H 282 E E E E 283 D D D D 284 P P P P 285 E E E E 286 V V V V 287 K K K K 288 F F F F 289 N N N N 290 W W W W 291 Y Y Y Y 292 V V V V 295 D D D D 296 G G G G 299 V V V V 300 E E E E 301 V V V V 302 H H H H 303 N N N N 304 A A A A 305 K K K K 306 T T T T 307 K K K K 308 P P P P 309 R R R R 310 E E E E 311 E E E E 312 Q Q Q Q 313 Y Y Y Y 314 N N N N 317 S S S S 318 T T T T 319 Y Y Y Y 320 R R R R 321 V V V V 322 V V V V 323 S S S S 324 V V V V 325 L L L L 326 T T T T 327 V V V V 328 L L L L 329 H H H H 330 Q Q Q Q 331 D D D D 332 W W W W 333 L L L L 334 N N N N 335 G G G G 336 K K K K 337 E E E E 338 Y Y Y Y 339 K K K K 340 C C C C 341 K K K K 342 V V V V 343 S S S S 344 N N N N 345 K K K K 346 A A A A 347 L L L L 348 P P P P 349 A A A A 350 P P P P 351 I I I I 352 E E E E 353 K K K K 354 T T T T 355 I I I I 357 S S S S 358 K K K K 359 A A A A 360 K K K K 361 G G G G 363 Q Q Q Q 364 P P P P 365 R R R R 366 E E E E 367 P P P P 368 Q Q Q Q 369 V V V V 370 Y Y Y Y 371 T T T T 372 L L L L 373 P P P P 374 P P P P 375 S S S S 376 R R R R 377 D D D D 378 E E E E 381 L L L L 382 T T T T 383 K K K K 384 N N N N 385 Q Q Q Q 386 V V V V 387 S S S S 388 L L L L 389 T T T T 390 C C C C 391 L L L L 392 V V V V 393 K K K K 394 G G G G 395 F F F F 396 Y Y Y Y 397 P P P P 398 S S S S 399 D D D D 400 I I I I 401 A A A A 402 V V V V 405 E E E E 406 W W W W 407 E E E E 408 S S S S 410 N N N N 411 G G G G 414 Q Q Q Q 415 P P P P 416 E E E E 417 N N N N 418 N N N N 419 Y Y Y Y 420 K K K K 421 T T T T 422 T T T T 423 P P P P 424 P P P P 425 V V V V 426 L L L L 427 D D D D 428 S S S S 430 D D D D 433 G G G G 434 S S S S 435 F F F F 436 F F F F 437 L L L L 438 Y Y Y Y 439 S S S S 440 K K K K 441 L L L L 442 T T T T 443 V V V V 444 D D D D 445 K K K K 446 S S S S 447 R R R R 448 W W W W 449 Q Q Q Q 450 Q Q Q Q 451 G G G G 452 N N N N 453 V V V V 454 F F F F 455 S S S S 456 C C C C 457 S S S S 458 V V V V 459 M M M M 460 H H H H 461 E E E E 462 A A A A 463 L L L L 464 H H H H 465 N N N N 466 H H H H 467 Y Y Y Y 468 T T T T 469 Q Q Q Q 470 K K K K 471 S S S S 472 L L L L 473 S S S S 474 L L L L 475 S S S S 476 P P P P 477 G G G G Variable regions: HFR1; 1-30, CDR-H1; 31-35, HFR2; 36-49, CDR-H2; 50-65, HFR3; 66-94, CDR-H3; 95-102, HFR4; 103-113 (Reference: Molecular Immunology. Volume 45, Issue 14 August 2008, Pages 3832-3839).

Table A2 Light chain origin PERTU- TRASTU- D3H44 ZUMAB ZUMAB RAMUCIRUMAB KABAT (SEQ ID (SEQ ID (SEQ ID (SEQ ID numbering NO: 4) NO: 5) NO: 6) NO: 7) 1 D D D D 2 I I I I 3 Q Q Q Q 4 M M M M 5 T T T T 6 Q Q Q Q 7 S S S S 8 P P P P 9 S S S S 10 S S S S 11 L L L V 12 S S S S 13 A A A A 14 S S S S 15 V V V I 16 G G G G 17 D D D D 18 R R R R 19 V V V V 20 T T T T 21 I I I I 22 T T T T 23 C C C C 24 R K R R 25 A A A A 26 S S S S 27 R Q Q Q 28 D D D G 29 I V V I 30 K S N D 31 S I T N 32 Y G A W 33 L V V L 34 N A A G 35 W W W W 36 Y Y Y Y 37 Q Q Q Q 38 Q Q Q Q 39 K K K K 40 P P P P 41 G G G G 42 K K K K 43 A A A A 44 P P P P 45 K K K K 46 V L L L 47 L L L L 48 I I I I 49 Y Y Y Y 50 Y S S D 51 A A A A 52 T S S S 53 S Y F N 54 L R L L 55 A Y Y D 56 E T S T 57 G G G G 58 V V V V 59 P P P P 60 S S S S 61 R R R R 62 F F F F 63 S S S S 64 G G G G 65 S S S S 66 G G R G 67 S S S S 68 G G G G 69 T T T T 70 D D D Y 71 Y F F F 72 T T T T 73 L L L L 74 T T T T 75 I I I I 76 S S S S 77 S S S S 78 L L L L 79 Q Q Q Q 80 P P P A 81 E E E E 82 D D D D 83 F F F F 84 A A A A 85 T T T V 86 Y Y Y Y 87 Y Y Y F 88 C C C C 89 L Q Q Q 90 Q Q Q Q 91 H Y H A 92 G Y Y K 93 E I T A 94 S Y T F 95 P P P P 96 W Y P P 97 T T T T 98 F F F F 99 G G G G 100 Q Q Q G 101 G G G G 102 T T T T 103 K K K K 104 V V V V 105 E E E D 106 I I I I 107 K K K K 108 R R R G 109 T T T T 110 V V V V 111 A A A A 112 A A A A 113 P P P P 114 S S S S 115 V V V V 116 F F F F 117 I I I I 118 F F F F 119 P P P P 120 P P P P 121 S S S S 122 D D D D 123 E E E E 124 Q Q Q Q 125 L L L L 126 K K K K 127 S S S S 128 G G G G 129 T T T T 130 A A A A 131 S S S S 132 V V V V 133 V V V V 134 C C C C 135 L L L L 136 L L L L 137 N N N N 138 N N N N 139 F F F F 140 Y Y Y Y 141 P P P P 142 R R R R 143 E E E E 144 A A A A 145 K K K K 146 V V V V 147 Q Q Q Q 148 W W W W 149 K K K K 150 V V V V 151 D D D D 152 N N N N 153 A A A A 154 L L L L 155 Q Q Q Q 156 S S S S 157 G G G G 158 N N N N 159 S S S S 160 Q Q Q Q 161 E E E E 162 S S S S 163 V V V V 164 T T T T 165 E E E E 166 Q Q Q Q 167 D D D D 168 S S S S 169 K K K K 170 D D D D 171 S S S S 172 T T T T 173 Y Y Y Y 174 S S S S 175 L L L L 176 S S S S 177 S S S S 178 T T T T 179 L L L L 180 T T T T 181 L L L L 182 S S S S 183 K K K K 184 A A A A 185 D D D D 186 Y Y Y Y 187 E E E E 188 K K K K 189 H H H H 190 K K K K 191 V V V V 192 Y Y Y Y 193 A A A A 194 C C C C 195 E E E E 196 V V V V 197 T T T T 198 H H H H 199 Q Q Q Q 200 G G G G 201 L L L L 202 S S S S 203 S S S S 204 P P P P 205 V V V V 206 T T T T 207 K K K K 208 S S S S 209 F F F F 210 N N N N 211 R R R R 212 G G G G 213 E E E E 214 C C C C Variable regions: LFR1; 1-23, CDR-L1; 24-34, LFR2; 35-49, CDR-L2; 50-56, LFR3; 57-88, CDR-L3; 89-97, LFR4; 98-110 (Reference: Molecular Immunology. Volume 45, Issue 14 August 2008, Pages 3832-3839).

Key for Tables 14-27 and 30-31 Table 14. LCCA data for D3H44 system, in Fab pair format Table 15. Additional LCCA data for D3H44 system, in Fab pair format Table 16. LCCA data for selected pure and mixed systems Table 17. LCCA data for selected pure and mixed systems, in Fab pair format Table 18. WT LCCA_data for selected pure and mixed systems Table 19. Table 17 data summary Table 20. Table 18 data summary Table 21. Mab assay data for D3H44 system Table 22. SMCA data for selected systems, pH 4 Table 23. DNA titration example for SMCA experiments Table 24. SMCA data for selected systems, pH 7 Table 25. Thermal Stability and Antigen binding data for selected SMCA designs Table 26. SMCA data for selected WT (vis-a-vis light chain tags) Table 27. Successful designs in both LCCA (D3H44) and at least one tested SMCA system Table 30. Design library Table 31. Core designs 

1.-61. (canceled)
 62. An isolated antigen binding polypeptide construct comprising at least a first heterodimer and a second heterodimer, the first heterodimer comprising a first immunoglobulin heavy chain polypeptide sequence (H1) and a first immunoglobulin light chain polypeptide sequence (L1), and binding to a first epitope; and the second heterodimer comprising a second immunoglobulin heavy chain polypeptide sequence (H2) and a second immunoglobulin light chain polypeptide sequence (L2), and binding to a second epitope, wherein at least one of the H1 or L1 sequences of the first heterodimer is distinct from the corresponding H2 or L2 sequence of the second heterodimer, H1 and H2 each comprise a heavy chain variable domain (VH domain) and a heavy chain constant domain (CH1 domain), and L1 and L2 each comprise a light chain variable domain (VL domain) and a light chain constant domain (CL domain); wherein: (i) at least one of H1 and L1 comprises at least one amino acid substitution resulting in greater steric complementarity of amino acids when H1 pairs with L1 as compared to L2; (ii) at least one of H2 and L2 comprises at least one amino acid substitution resulting in greater steric complementarity of amino acids when H2 pairs with L2 as compared to L1; (iii) at least one of H1 and L1 comprises at least one amino acid substitution resulting in greater electrostatic complementarity between charged amino acids when H1 pairs with L1 as compared to L2, or (iv) at least one of H2 and L2 comprises at least one amino acid substitution resulting in greater electrostatic complementarity between charged amino acids when H2 pairs with L2 as compared to L1, wherein: a) H1 comprises amino acid substitution at residue 145, L1 comprises amino acid substitutions at residues 124 or 131, and: i) H1 further comprises amino acid substitution at residue 179, L1 comprises amino acid substitution at residue 131, H2 comprises amino acid substitution at residue 186 or at residues 146 and 179, and L2 comprises amino acid substitution at residue 180; ii) H1 further comprises amino acid substitution at residues 186 and 188, H2 comprises amino acid substitution at residue 186, and L2 comprises amino acid substitution at residue 178 or at residues 160 and 180; iii) H1 further comprises amino acid substitution at residues 37, and 179, L1 further comprises amino acid substitution at residue 98, H2 comprises no amino acid substitutions that promote preferential pairing, or H2 comprises amino acid substitution at residue 186, and L2 comprises amino acid substitution at residue 98; or iv) H1 further comprises amino acid substitution at residue 179, L1 comprises amino acid substitution at residue 131, H2 comprises no amino acid substitutions that promote preferential pairing, or H2 comprises amino acid substitution at residue 186, and L2 comprises no amino acid substitutions that promote preferential pairing, or L2 comprises amino acid substitution at residues 124, 160, and 178; b) H1 comprises amino acid substitution at residues 139, 145, 188, and 190; L1 comprises amino acid substitution at residues 116, 133, 176, and 178; H2 comprises amino acid substitution at residues 186 and 188, and L2 comprises amino acid substitution at residues 118, 133, 176, and 180; c) H1 comprises amino acid substitution at residues 139, 188, and 190; L1 comprises amino acid substitution at residues 135, 176, and 178; H2 comprises amino acid substitution at residues 139 and 188, and L2 comprises amino acid substitution at residues 116, 135, and 176, d) H1 comprises amino acid substitution at residue 146 and optionally at residue 179 or 186; L1 comprises amino acid substitution at residue 124; H2 comprises amino acid substitution at residue 145; and L2 comprises amino acid substitution at residue 178, e) H1 comprises amino acid substitution at residues 143 and 145, and i) L1 comprises amino acid substitution at residue 124 or at residues 160 and 178; H2 comprises amino acid substitution at residue 186, and L2 comprises amino acid substitution at residue 124 or at residue 178; ii) L1 comprises amino acid substitution at residue 124; H2 comprises amino acid substitution at residue 143, and L2 comprises amino acid substitution at residue 124, 133 or 178; iii) L1 comprises amino acid substitution at residue 124; H2 comprises no amino acid substitutions that promote preferential pairing, and L2 comprises no amino acid substitutions that promote preferential pairing or L2 comprises amino acid substitution at residue 124; or iv) L1 comprises amino acid substitution at residues 124 or at residues 160 and 178; H2 comprises amino acid substitution at residues 146 and 179, and L2 comprises amino acid substitution at residues 124, 160, and 180, or at residues 124, 160, and 178; f) H1 comprises amino acid substitution at residue 139, and i) L1 comprises amino acid substitution at residue 116; H2 comprises no amino acid substitutions that promote preferential pairing, or H2 comprises amino acid substitution at residue 139, and L2 comprises no amino acid substitutions that promote preferential pairing, or L2 comprises amino acid substitution at residue 135, ii) L1 comprises amino acid substitution at residue 116, H2 comprises amino acid substitution at residue 39 or at residue 139, and L2 comprises amino acid substitution at residue 135, or iii) L1 comprises amino acid substitution at residues 116 and 135, H2 comprises amino acid substitution at residues 139 and 190, and L2 comprises amino acid substitution at residue 135, g) H1 comprises amino acid substitution at residue 39, L1 comprises amino acid substitution at residue 38; H2 comprises amino acid substitution at residue 39, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 38, or comprises no amino acid substitutions that promote preferential pairing, h) H1 and H2 comprise amino acid substitution at residue 124, and i) L1 and L2 comprise amino acid substitution at residues 133 and 176; or ii) L1 comprises amino acid substitution at residue 133, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 133 or at residue 118; i) H1 comprises amino acid substitution at residue 139; L1 comprises amino acid substitution at residue 116, or at residues 118 and 133, or comprises no amino acid substitutions that promote preferential pairing; H2 comprises amino acid substitution at residue 139 or at 190, and L2 comprises amino acid substitution at residue 116, at residue 135, at residues 118 and 133, or comprises no amino acid substitutions that promote preferential pairing, j) H1 comprises amino acid substitution at residue 172; L1 comprises amino acid substitution at residue 174; H2 comprises amino acid substitution at residue 172, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises no amino acid substitutions that promote preferential pairing, k) H1 comprises amino acid substitution at residues 174 and 188; L1 comprises amino acid substitution at residue 176; H2 comprises amino acid substitution at residue 188, and L2 comprises amino acid substitution at residue 176 or at residue 133, or comprises no amino acid substitutions that promote preferential pairing, l) H1 comprises amino acid substitution at residue 125; L1 comprises amino acid substitution at residue 174 or comprises no amino acid substitutions that promote preferential pairing; H2 comprises amino acid substitution at residue 228, and L2 comprises amino acid substitution at residue 121, m) H1 and H2 comprise amino acid substitution at residue 39 and L1 and L2 comprise amino acid substitution at residue 38, n) H1 comprises amino acid substitution at residue 37 or 45; L1 comprises amino acid substitution at residue 87 or 98; H2 comprises amino acid substitution at residues 37 and 103, and L2 comprises amino acid substitution at residue 44, o) H1 comprises amino acid substitution at residue 37 or at residue 39; L1 comprises amino acid substitution at residue 38 or 98; H2 comprises amino acid substitution at residues 37, 39, and 103, and L2 comprises amino acid substitution at residues 38 and 44, p) H1 comprises amino acid substitution at residue 45; L1 comprises amino acid substitution at residue 44; H2 comprises amino acid substitution at residue 45, or no amino acid substitutions that promote preferential pairing, and L2 comprises no amino acid substitutions that promote preferential pairing, q) H1 comprises amino acid substitution at residue 37 and L1 comprises amino acid substitution at residue 98, and i) H2 comprises amino acid substitution at residues 37 and 100, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprise amino acid substitution at residue 98, or comprises no amino acid substitutions that promote preferential pairing, ii) H2 comprises amino acid substitution at residue 39, and L2 comprises amino acid substitution at residue 38, iii) H2 comprises amino acid substitution at residue 103, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 44 or at residue 98, iv) H2 comprises amino acid substitution at residue 37, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 98, or comprises no amino acid substitutions that promote preferential pairing, v) H2 comprises amino acid substitution at residue 139 or at residues 37 and 103 and L2 comprises amino acid substitution at residue 44 or at residues 98 and 116, vi) H2 comprises amino acid substitution at residue 39, and L2 comprises amino acid substitution at residue 38, vii) H2 comprises amino acid substitution at residue 45, and L2 comprises amino acid substitution at residue 44, viii) H2 comprises amino acid substitution at residue 103, and L2 comprises amino acid substitution at residue 44, or ix) L1 further comprises amino acid substitution at residue 89, H2 comprises amino acid substitution at residue 37, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 98, or comprises no amino acid substitutions that promote preferential pairing, r) H1 comprises amino acid substitution at residue 100; L1 comprises amino acid substitution at residue 98; H2 comprises amino acid substitution at residue 103, or comprises no amino acid substitutions that promote preferential pairing, and L2 comprises amino acid substitution at residue 36, at residue 98, or at residues 44 and 89, or comprises no amino acid substitutions that promote preferential pairing; s) H2 comprises amino acid substitution at residue 39, and L2 comprises amino acid substitution at residue 38, and i) H1 comprises amino acid substitution at residue 39, at residues 37 and 103, and L1 comprises amino acid substitution at residue 38 or 98, or ii) H1 comprises amino acid substitution at residue 39, or at residues 100 and 103; L1 comprises amino acid substitution at residue 98; and L2 further comprises amino acid substitution at residue 98, t) H1 comprises amino acid substitution at residue 37 or at residue 100; L1 comprises amino acid substitution at residue 98; H2 comprises amino acid substitution at residue 37 or 103, and L2 comprises amino acid substitution at residue 44, at residue 89, or comprises no amino acid substitutions that promote preferential pairing, or u) H1 comprises the amino acid substitution 1881, or conservative substitution thereof; L1 and H2 comprise no amino acid substitutions that promote preferential pairing, and L2 comprises the amino acid substitutions 176V and 178L, or conservative substitutions thereof and wherein the numbering of amino acid residues is according to Kabat.
 63. The construct of claim 62, wherein when both L1 and L2 are co-expressed with at least one of H1 and H2, the pairing of H1-L1 to H1-L2 and the pairing of H2-L2 to H2-L1 is greater than the pairing of H1-L1 to H1-L2 and the pairing of H2-L2 to H2-L2 in the absence of the amino acid substitutions.
 64. The construct of claim 63, wherein H1, H2, L1 and L2 are co-expressed in a cell or a mammalian cell, or H1, H2, L1 and L2 are co-expressed in a cell-free expression system, or H1, H2, L1 and L2 are co-produced, or H1, H2, L1 and L2 are co-produced via a redox production system.
 65. The construct of claim 64, wherein the first immunoglobulin light chain polypeptide sequence and/or the second immunoglobulin light chain polypeptide sequence is a kappa light chain polypeptide.
 66. The construct of claim 64, wherein the construct further comprises an Fc comprising two heavy chain constant domain polypeptides, each comprising a CH3 sequence, wherein the heavy chain polypeptides are coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer.
 67. The construct of claim 66, wherein the Fc is a human Fc, a human IgG1 Fc, a human IgA Fc, a human IgG Fc, a human IgD Fc, a human IgE Fc, a human IgM Fc, a human IgG2 Fc, a human IgG3 Fc, or a human IgG4 Fc.
 68. The construct of claim 67, wherein the Fc is a heterodimeric Fc.
 69. The construct of claim 68, wherein the Fc comprises one or more modifications in at least one of the CH3 sequences that promote the formation of a heterodimeric Fc with stability comparable to a wild-type homodimeric Fc.
 70. The construct of claim 69, wherein: a) one of the CH3 sequences comprises the amino acid substitutions L351Y, F405A, Y407V and the other comprises the amino acid substitutions T366L, K392M, T394W; b) one of the CH3 sequences comprises the amino acid substitutions L351Y, F405A, Y407V and the other comprises the amino acid substitutions T366L, K392L, T394W; c) one of the CH3 sequences comprises the amino acid substitutions T350V, L351Y, F405A, Y407V and the other comprises the amino acid substitutions T350V, T366L, K392M, T394W; d) one of the CH3 sequences comprises the amino acid substitutions T350V, L351Y, F405A, Y407V and the other comprises the amino acid substitutions T350V, T366L, K392L, T394W; or e) one of the CH3 sequences comprises the amino acid substitutions T350V, L351Y, S400E, F405A, Y407V and the other comprises the amino acid substitutions T350V, T366L, N390R, K392M, T394W, wherein the numbering of amino acid residues of the CH3 sequences is according to the EU numbering system.
 71. The construct of claim 70, wherein the Fc further comprises at least one CH2 sequence.
 72. The construct of claim 71, wherein the Fc comprises one or more modifications to promote selective binding of Fc-gamma receptors.
 73. The construct of claim 66, wherein the one or more linkers are one or more polypeptide linkers, comprising one or more antibody hinge regions or one or more IgG1 hinge regions.
 74. The construct according to claim 62, wherein the sequences of each of H1, H2, L1, and L2 are derived from human or humanized sequences.
 75. The construct according to claim 62, wherein the construct is multispecific or bispecific.
 76. The construct according to claim 70, wherein the construct is conjugated to a therapeutic agent or drug.
 77. A pharmaceutical composition comprising the construct of claim 62 and a pharmaceutically acceptable carrier.
 78. The pharmaceutical composition according to claim 77, further comprising one or more substances selected from the group consisting of a buffer, an antioxidant, a low molecular weight molecule, a drug, a protein, an amino acid, a carbohydrate, a lipid, a chelating agent, a stabilizer, and an excipient.
 79. An isolated polynucleotide or set of isolated polynucleotides comprising at least one sequence that encodes the construct according to claim
 62. 80. The isolated polynucleotide of claim 79, wherein the polynucleotide or set of polynucleotides is cDNA.
 81. A vector or set of vectors comprising the isolated polynucleotide or set of isolated polynucleotides according to claim
 79. 82. The vector or set of vectors according to claim 81, which is selected from the group consisting of a plasmid, a multi-cistronic vector, a viral vector, a non-episomal mammalian vector, an expression vector, and a recombinant expression vector.
 83. An isolated cell comprising the vector or set of vectors according to claim
 81. 84. The isolated cell according to claim 83, wherein the cell is a hybridoma, a Chinese Hamster Ovary (CHO) cell, or a HEK293 cell.
 85. A method of obtaining the construct according to claim 62 from a host cell culture, the method comprising the steps of: (a) obtaining a host cell culture comprising at least one host cell comprising one or more nucleic acid sequences encoding the construct; and (b) recovering the construct from the host cell culture.
 86. The method of claim 85, wherein the construct comprises an Fc comprising at least two CH3 sequences, wherein the Fc is coupled, with or without one or more linkers, to the first heterodimer and the second heterodimer.
 87. The method of claim 86, wherein the Fc is a heterodimer, optionally comprising one or more amino acid modifications.
 88. A method of treating a subject having a disease or disorder or cancer or vascular disease comprising administering the construct according to claim
 62. 